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MAXIMA AND MINIMA: THE SEMI-DEFINITE CASE 
By T. W. CHAUNDY (Ozford) 
[Received 17 July 1932] 


1. Tue theory of the maxima and minima of an analytic* function 
of a single (real) variable is elementary and complete: we evaluate 
the sequence of expressions D”f(x) at the stationary point under 
review; if the earliest term of the sequence which does not vanish 
there is positive and of even order, the stationary point is a minimum. 
This condition is both necessary and sufficient. 

When, however, we turn to functions of two or more independent 
variables, this satisfactory state of affairs ceases. The elementary 
theory is complete for the case in which the Hessian does not vanish 
at the stationary point: for the case of the vanishing Hessian it does 
little more than indicate a procedure. 

For instance, with f(x,y), we have, in the neighbourhood of a 
stationary point, 

8f = firx(8%)?+ fry 3x8Y + fyy(Sy)? 
to a first approximation. The elementary theory covers the case in 
which this approximation is sufficient to determine the sign of 3f. 
If, however, the Hessian 
H (zx, y) = Senduy—Fey 

vanishes, the quadratic in dx, dy is semi-definite, i.e. never actually 
changes sign but is zero for a certain value of the ratio 5a/5y, and we 
must then proceed to a closer approximation to decide the sign of 5f. 


2. If we interpret f(x,y) = constant as a plane curve, the sta- 
tionary point corresponds to a double point; a maximum or minimum 
appears as an isolated (conjugate) point, whereas a node with real 
branches represents a point neither maximum nor minimum. The 
‘semi-definite’ case corresponds to a cusp, and further examination 
is, in general, necessary to show whether this cusp will appear in the 
figure as an isolated point or as a double point with real branches. 

In practice, methods of successive approximation, such as belong 
to the technique of curve-tracing, will usually be found adequate. 

* We need some condition to ensure that a sufficiency of derivatives exist 
and do not all vanish. The analytic condition, though too stringent, is con- 


venient. 
3695.4 B 





2 T. W. CHAUNDY 

Scheefer* and Dantschert have systematized this method of attack, 
showing, in particular, that the process succeeds in a finite number 
of steps. 

3. In the present note I give explicit criteria for a stationary point 
to be a minimum,{ in a form analogous to the elementary criteria 
for functions of a single variable. That is to say, I define a sequence 
of expressions H,(x,y) to be evaluated at the stationary point: at 
a minimum the first of them which does not vanish must be positive 
and of even order. Unhappily the sequence is not unique,§ though 
we may select any preferred variety. 

[ discuss in detail only the function of two variables. The exten- 
sion to many variables and to implicit functions is not difficult. 
I consider only functions which are differentiable as often as we 
need. I restrict myself naturally to the ‘semi-definite’ case of vanish- 
ing Hessian. I go further and consider only functions whose first 
approximation at the stationary point is actually quadratic, ex- 
cluding, for example, such a case as 

f(x,y) = (dp, 41, Ag, Az, Uy, M5, Ag)(x, y)® 
at the origin. For this confronts us with the problem of determining 
convenient conditions that the sextic have no real roots, and this is 


a problem which, pace Sturm, is still unmanageable in practice. I 
confine attention, accordingly, to functions whose critical behaviour 


depends only on quadratic forms. 

For a function f(x,y) I establish the following as my fundamental 
theorem: 

THEOREM 1. A stationary point at which the Hessian vanishes is 
a minimum of f(x, y), if and only if the first term of the sequence H,,(x, y) 
which does not vanish at the point is positive and of even suffix, where 


ae gt ; a ‘te 
1, (x, y) . 1? (fe r — See: , rq (fn a Jay 2 f(x, Y); 
: y Oa dy) } 
p, q being arbitrary parameters independent of x, y. 


* Math. Annalen, 35 (1890), 541. 

+ Math. Annalen, 42 (1893), 89 and 51 (1899), 227. See also Valiron, 
Nouv. Ann. (4) 20 (1920), 41. 
{ Henceforward I shall disregard maxima as being sufficiently included in 
the theory of minima. 

§ This is not surprising, since we are not interested in H, unless H, = 0 
(r <n). These equations give relations between the derivatives of order less 
than n, which may be used to modify the form of H,,. 
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This theorem must be implicit in the theory of Scheefer and 

Dantscher, since I introduce no new analytical principle. The dif- 

ference is essentially one of algebraic rearrangement, and for that 
reason I develop my argument independently. 


4. I base my discussion on an examination of f(x, y) along ‘curves’ 
2 = x(t), y= y(t) through the stationary point: this helps, inci- 
dentally, to bring out the analogy with the single-variable theory. 
It is, of course, clear that if f(x, y) is a minimum at the point, then it 
is a minimum along every curve through the point, since the whole 
includes the part. I use the converse of this in the following stricter 
form: 


LemMa 1. f(x, y) is a minimum at P, if it is a minimum at P along 
every curve x = x(t), y = y(t) through P, such that at P 

(i) the derivatives D™x, D™y exist for every m not exceeding some 

specified r; 

(ii) Dx, Dy are not both zero. 

| Here and afterwards D denotes d/dt. | 

The proof depends on showing that, given a set of points having 
P as a limiting point—here a set of points at which f has a value 
not exceeding its value at P—we can describe a curve of the type 
specified in the lemma which will pass through a sub-set of the points, 
this sub-set also having P as a limiting point. I suppress the proof 
since the result of the lemma in this more general aspect is not 


unexpected.* 


5. I shall write 


‘ ‘ a) ae " - rs . m /yy' 
,, 0, for 0/éx, @/éy, D,, for the operator D™/m!, 


Lins Ym for D,, x, Dry, fing for 6” *4f/Cax?Oy?. 


- 
We are to think of D,f as a polynomial in 2,, ¥;,..., 2%, y, Whose 
coefficients are partial derivatives of f, and so functions of 2, y. 
These coefficients are to be evaluated at the assigned stationary point 
(2%, Yo), and 2, y accordingly disappear as variables, unless the con- 
trary is stated. I shall then make frequent use of the following 
lemma: 


* In all probability it is somewhere already in print in some equivalent 
form. 
B2 
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Lemma 2. If we regard x, Y,..., X,, y, a8 all independent, then 
C 


Ox (Duin ) eal Made (m-+-n"< r). 
To prove this observe that, by Taylor’s theorem, D,,,,,f is the 
coefficient of ¢”*” in 
¥ {(ay t+... a, 07)0,+ (ys t+... +y, t7)2,}*f/8!. 
s=0 


Differentiation in ,, gives 


” ze {(at+...+%,0)0,+(yyt+...+y,l)é,} 40, f/(s—1)!, 


in which the coefficient of ?”+” is D, f,. 
6. It is convenient to impose 2 homographic transformation on 
the variables x, y, so that we may have at the stationary point 
Joo = 1, fu = 9, So2 = 9. (1) 
The critical direction in which alone the semi-definite quadratic 
 _ -\2_1| OF . 3 2 
D°f = f9(Dx)?+ 2f;, DxDy+foo(Dy) 
actually vanishes is now Dz = 0. We may define it, in view of (1), as 
Df, = 0. 
More generally we shall have to examine the behaviour of f along 
critical curves whose derivatives satisfy the conditions 
Df,=90 (8=1,...1). (2) 
The highest derivatives involved are x,, y,. By Lemma 2 
c : . c ; 
=~ (D,fr) = foo = 1, = (Dr fz) = fu = 9, 
OX, OY, 
so that D,f,, = 0 may be written in the form 
ae |) eee ty Meee | (3) 
Hence we may use equations (2) to substitute in succession for 
2X, Lp—4s-+-» Ly, leaving at length only the variables y,_,,..., y;. The 
result of such substitution in an expression F I denote by [F',. 
The substitution itself I call X,, and a curve whose derivatives satisfy 
the relations (2) I call a curve C.,. 
7. Now Lemma 2 at once gives 
é : 0 : 
— (Dpfz) a ee (p fm q > m), 
Yq CYo—m 
and my first step is to show that this relation persists when we have 
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applied the respective substitutions X,,_,, X,,_,,-; to the two sides, 
i.e. to show that 


Oo 
=—[Dpfzl,_ cig 


OY, 


é 
OYq- m 

Now y, enters [F'|, both directly and also through every sub- 
stitution 

"7 x, = A,(x,_1, 


—[D,- mz], —m-1 (p > q z= m). 


wey U5 Yos--sy Y;) (r>82> q); 


or more precisely through 


Xs [A,],_.- 


Thus 
: s<p-1, , r 
oO o o 
) = | —_(p) tn SP _— A 
2 ip igs Pofed| + D [Pole] ae la (5) 


If q = p, we prove our result (4) at once, for there are now no terms 
in ¥ and we have merely 


C a] 
—|D = |—(D 
c ve rbely-s lame? rfa)| 


-| of (Dy-mbe)| , by Lemma 2, 


PY n—m p-1 


F,) 
? E (Dy-mf)| ? 
CY p—m p-m-1 


since now p—m—1 is the highest suffix of xz. Thus 


ra) 
= [Dpfely-a = [3 
ee CY n—m 
If g < p, I use an inductive argument: fixing qg, I show that (5) is 
true for p, if true for p—1. Now by the definition (3) 


A, = 2,—D,f,, 


(Dp-mfe)| (6) 


rap) 
Yp p-m-1 


8 


and so ay! oh. = i. D.f,|,_ 1 


o 
OYo—m 
by the hypothesis above, since s < p—1. 


Again, in analogy with (6), 
0 
-(D 
Ee ( p mf) 


[D 8— natal, m-1 


p-m-1 
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Thus, by (5), (6), (7), (8), 
as (Ppfa) =—2 | (Pym) 


9Yg -9Yg—m p-m-1 


p-1 


s—-msp-mM Ir Q 
nae = |: (Dy-mfs) 
OX. ; 


“sm 


C 
| D, ee ore 


OY, 
s—-meq-m p-m-1 Yq m 


| (Dy-nfa) 


a) 
ae Yq m p-m-1 


Thus the induction is established, and, since (5) is true for p = q, it 
is proved for any p not less than q. i 


In similar fashion 


‘ ‘ 8p ‘ 
c é ~ c Cc 
—Pofl, =|— Of] +>[- ON] od. 
dy,” -P “"* pn Siler, » Ya s—1 
and it follows immediately that 
c ’ C > 
ae | dnd | ‘ [Dp mf | ? (p ; P m). 
c Yq ™ c Yq m en 


8. Now let us examine the hypothesis that simultaneously 
. 
-[D,f], = 0 
OY, * 
Then, by (9), 


é 7 _ é 
ay. Pot, = gy lPo-rssf Ir 


0, by the hypothesis, if p >r > 1. 


Thus [Df], is a function of y, alone. Dimensional considerations 
show that it is a pure power of y,: more precisely, we write 

[Df], : H,,(X9, Yo)y?; 
where H,, involves only partial derivatives of f evaluated at the 
stationary point (%», Yo). Hence, if H,,(x», yp) = 0, we shall have 


(11) 


> Cc . 
[Df |, = 0, iy, Pod J, = 0, 


and we can extend the hypothesis (10) up to s = p. More generally, 
if we have the set of conditions 
Ai(%,Yo) = 9 (r= 1,...,2n—1), 


we can deduce inductively that 


H. 


2n 


(%, Yo) > 9, (12) 


[D,f],=9 (r=1,...,2n—1),  [Danf],, > 0. (13) 
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9. I now show that the conditions (12) are sufficient to establish 
a minimum of f(x,y) at the stationary point (2p, ¥»). 
In the first place, dimensions show that D,,,,/, D.,f are both linear 
IN Xp44)-++)%a,- By Lemma 2 
c ~ = 
Ox (Doiaf) = D,-sishe (7 >8 D1). 
“r+s 
Thus the substitution X, clears D,,,,f of %,,,,...,%,, as well as 
C15... t,: that is to say, 
| Dz, + if |,. Y Seng [Do, + if |. 
Similarly [D2f |, = [Daf ],- 
Moreover, D,,f is quadratic in x,, and by Lemma 2 


“ (D,,f) = D,f, = %,—A,, by (3). 
c vw, 
Thus D.,f = 4(x,—A,)?+B,, 


where B,, like A,, involves 2,,..., %,_, but not 2,,..., %,, and so 


[Df ],_, = #2,—[4,],_,?+1[4,]_,- 
The further substitution x, =[A,] | gives 
[Df], = [B,]_, 
and so | Dz, f | | D,f 1+ 3{2,- [ A, |. a (14) 
Now the curves C, are a sub-class of the curves C,,_,, and (14) shows 
that, along those curves C,_, that do not also belong to the sub- 


( lass Cc. 


|Dz,f q "es > 0, if [D,,f I. = 0. 


[Df |. : [Df |. - 0, by (13). 


Thus, along a curve C,_, (r <n) that is not also a curve C,, the 
earliest derivative of f which does not vanish at the stationary point 
is [Dp,f ],_,, Which is positive and of even order: along all curves C,,, 
the earliest derivative which does not vanish at the stationary point 
is [D,,,f |, which again is positive and of even order. 


10. We have now accounted for all curves through the stationary 
point in which D?"x, D®"y exist and Dz, Dy are not both zero. Along 
all these curves we have shown that f(x, y), under the conditions (13), is 
a minimum. Hence, according to Lemma 1, conditions (13) suffice to 
prove f(x,y) a minimum at the stationary point. 
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Conversely, if along some C,, 
re] = -< nN— 
[Df], = 0 (r < 2n—1), 
then, since f has to be a minimum along this curve, we must have 
[Df], > 0. Hence necessarily for a minimum the earliest H,(x9, ¥) 


which does not vanish must be positive and of even suffix. 

We have therefore proved the first half of Theorem 1 by estab- 
lishing the characteristic sequence H,,. We have still to consider the 
explicit determination of H,,(2%9, Yo). 

11. We no longer restrict ourselves to the stationary point 
(9, Yo) but consider the general functions H,,(x,y). The substitution 
x, = A, appropriate to the stationary point must accordingly be 
replaced by its complete form 

DA fr) = foo%+huyr—Ar = 9, 
or, let us say, Df) = feo(t%-—C;,), (15) 
where » = OF ,45000) 245 1 Y13 X,Y). 

The substitution in this complete form is, I shall show, unaffected 

by differentiation. The differential operator itself, 

D = 2,0,+Y9,4,, 
admits, of course, the substitution X,. For brevity I shall denote 
the transformed operator by D,, i.e. 

D, = C,4,+9,4,. 
I then prove that, for any polynomial ¢, 

[Do(a,,%-1--)],, = Dill$(,, %—-1---)] }. 
The left-hand side of (16) is* 
[r+ Ve br tre, Pot | = (V+ VLC ah Ls), 771], b2], + 
The right-hand side is 
fr “fy Y { 
D, dC,],_ LG, 1], a} = D,{{¢ |, [i], +Dif{[C,-1],_.} [ge], sone 
I have then to prove that 
(s+1)[C, 4], D,i[C,],_,} (17) 

for s = r,r—1,.... I prove it inductively. 

Assuming then that (16) holds up to r—1, we have in particular 

[DG(x,~1,%,-25-++)], = Dy C(t,-1 %-2,---)],_s} 
i.e. briefly [DC,], = D,{[C,]_,}- (18) 


* I write ¢, for the partial derivative of ¢ in its first argument, and so on, 
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Now, from (15), 
(7+1)foo(%-+1—C,+1) = Dl foo(x,—C,)} 
= foo{(r+1)x,,,—DC,}+ (x,—C,)Dfoo. 
The substitution X, gives 
(r+1)[C,.1], = [DC,]., 

i.e., with (18), (r+1)[C,41], = Di{[C,],_,}- 
Hence, if (17) hold up to s = r—1, it holds for s = r also, and the 
induction is established. Since (16) obviously holds for r = 0, then 
(17) holds for s = 0 and I have my starting-point and (16) is 
proved universally. 

12. Applying this result to D,, f, we have that 

(n+1)A, yf? = (M+1[Drsaf), 
= DD, f),} = PiAny?)- 
Now D(f,) = foo%itSir 41 = foo(%1:—C,), by definition of C,. Thus 
we have, for D,, 
= (Ys/foo)( Foo %y—hir @x); 

and so (n+1)A Yt = (Ys/foo)(Fo0&y—fir &2)(An 91 )- (19) 

We reach H,,,, only when H,, vanishes. The effect of the equation 
H,, = 0 and the definition of H,,,, by differentiation of H,, are there- 
fore not affected, if we multiply H,, by any factor which has finite 
first derivatives at the stationary point, and is not zero or infinite 
there. It is accordingly sufficient to define the sequence H,, by the 
recurrence-formula 


a tl ~ = (fog 2 y —f, 0,)H, 


and so to arrive at the definition 
H,, = (foo y—Sia é,) S, (20) 


where after differentiation we put fx) = 1, f,, = 0. 


13. This is the formula of Theorem 1, but restricted to the special 
case in which 
foo = 1, fu = 9, fon = 0 
at the stationary point. To include the general case in which at the 
stationary point 
H=0;  feofog 49 (> 0 for a minimum), 
we swing the axes by the linear transformation 


x= pX+p'Y, y = qX+q') 
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where p, p’, g,q¢' are constants such that at the stationary point 
Sixx = (pe,+9e,)*f = 1, fry = (v'2,4+-9'6,)7f = 
Sxy = (P0,+4y)(P'e, + Oy)f = 9. 
These equations give at the stationary point 
Soo/? = —fulP'Y = So2/P” 1/(pq'—p'q)”, 
and at any age 

fxx°y—Ixy = (p'q—pY KPC Sir x —F20 %y) +4 So2 &x—fir Sy) 

Thus ct la we may write 

H, (P( fir 2x —Fo0 Oy) +4 foz O2—Sir Oy )}'F- (22) 
Now equations (21) fix the ratio p’/q’, but the ratio p/q is left unde- 
fined, provided only that p’g—pq’ ~ 0. In geometrical language one 
of the new (X, Y)-axes is fixed as the cuspidal tangent, the other is 
at our disposal (except, of course, that it must be distinct from the 
prescribed axis). With this trivial exception we may therefore regard 
p,q in (22) as ‘arbitrary constants’ and expand in powers p‘g”~*. 
This gives us n+1 coefficients, which are all alternative forms for 
H_,,: they could be proved equivalent by use of the equations H, = 0 
(ry <n). On the other hand, by giving p,q special pairs of values 
not proportional to p’, q’ we obtain infinitely many varieties of H,,, 
but at most n-+-1 of these varieties are linearly distinct. The proof 
of Theorem | is now complete. 

From the theoretical point of view we must remark that the satis- 
faction of the sequence of conditions for a particular p,q is sufficient 
for a minimum: the satisfaction of the conditions for every p,q 
(p'g—py’ & 0) is necessary for a minimum. In particular, then, their 
satisfaction for any p,q secures their satisfaction for every p,q. We 
may also note that the expressions H,,,, regarded as polynomials in 
p,q, must maintain an invariable sign, under the given conditions, 
for all real p,q: exceptionally they all vanish when pq’—p’q = 0. 

14. We are left, as we have said, with a wide variety of choice 
for H,,. For instance, we may write symmetrically 

H, — {V/(fo2dé x -I(foo)@y}"f; 
in which every H of odd order is irrational. 

The evaluated forms are not simple: I collect the first three* of 
them below. 


* We ignore H,, H,. Actually H, is zero, H, is the Hessian. 








MAXIMA AND MINIMA: THE SEMI-DEFINITE CASE 


At the stationary point write 
ia= @#, fu =, fea = ¥. 


Then we have 


H, = (b@,—aé,)*f; 
H, (b2,,—aé,)*f+-3{(b0,—a?,) foo. (bé, —a@,) foo—| (b0,,—aé,)f,4}"}; 


H,, = (b0,—aé,)®*f+-5{(b0,—a2,,)*fo9 . (62,,—0,,) fos — 
— 2(b0,—ad,)*f,;. (60,,—a0,)f y+ bé,—a0,)*foo- (b2,,—a0,,)fa9}- 
. The foregoing theory can be extended to functions of more 


15 
two independent variables. If, for brevity, we write H,, of 


than 
Theorem 1 in the symbolic form 


H, a, 4% 0\"f 
rx Sry —4 
Sey Svy p 
with the convention that the operators @,, @, have no effect on the 
other elements of the determinant, then the analogous form for three 
independent variables x, y, z is found to be 
H,, a & & 6%, 
Frx Sey Ses P 
Sua Sey Sus q 
fez Sue Sea * 
where p, g, 7 are arbitrary constants, and so forth. 

Here again the theory is effective only when df is quadratic at the 
stationary point. It fails, if every first minor of the Hessian vanishes 
at the stationary point—as is otherwise obvious, for then every H,, 
vanishes identically. 

16. The theory extends also to functions defined implicitly. Thus, 
for the minima of f(x, y,z) under the condition 4(x, y,z) = 0, we find 
that, symbolically, 
H,, = 0, dy é, 0 ol, 

Frx—Mbrx Fry—Mry See—APrz ba P 
Sey < bry Say = Ad yy Svs o. Ady by q 
, —Ad,. Sz—Abyz Jua—War $. r 

p, py ¢: 0 0 


where A is any of f,/¢,, f,,/¢,,f./¢,, Which are all equal at the stationary 


point. 








A CONTRIBUTION TO THE THEORY OF THE 
STRAIGHT BEAM 


By M. REINER (Easton, Pa.) 
[Received 3 June 1932] 
1. THE problem of the straight beam can be stated as follows: 
A straight prismatic beam (see Fig. 1) is subjected to the action 


of 
(i) its own weight, the components of which are 


X = —ysina, Y = —ycosa (1) 


per unit volume, y being the specific weight of its material ; 























Fic. 1. W, and W, are the x and y components of the resultant of all surface 


forces acting on the prismatic boundary-surface, the distribution of which is not 


shown in the figure. 


(ii) such given surface stresses X, ,Y,, Z, as have the following 


resultant forces and moments: 


(a) on the terminal section z = 0, 
the longitudinal force N°, the transverse forces W{, W!??, 
the bending moments VM“, M\, the torsional moment MQ; 
(b) on the prismatic boundary surface, 
the longitudinal force W, per unit length, transverse forces 
W,, and W, per unit length, torsional moment M, per unit 
length; 
(c) on the terminal section z = z, 
forces and moments analogous to (a). 
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The beam is in equilibrium. This means that the forces and 
moments under (c) can be calculated from the rest. The problem is 
to find the deformation of and the stresses in the beam. 

This problem was solved with regard to the action of forces and 
moments specified under (ii) (a) by St. Venant.* We therefore need 
not consider this part of it here. With regard to the action of forces 
and moments specified under (i) and (ii) (b) the problem was attacked 
by J. H. Michell} for the case when W, is equal to zero and W,, W,, 
and M, are independent of z, i.e. for loads normal to and uniformly 
distributed along the axis of the beam (but of arbitrary distribution 
over its boundary). He did not carry it to explicit solution, but only 
reduced it to a problem of plane strain with body-forces. An explicit 
solution was derived independently, on the semi-inverse method, by 
the present author{ in respect of the action of M,. In the present 
paper an explicit solution of the “Michell problem’ is given in respect 
of the action of W, and W, by reducing it to a problem of plane 
strain without body-forces. 


2. Michell had assumed the stresses to be quadratic functions of 


the z-coordinates, i.e. 


q 70 7 (1 day ”(2)52 
X, = XO4XWz4 X22 


Y, = YOLYW24 Yer? 


where X“), X, X®)... are functions of x and y only. 


Because of the strain-stress relations 


ee ei 
= 5{X,—o(¥, + Z,)} 


* Mém. des savants étrangers, 14 (1855); J. de Math. (Liouville) (2), 8 
(1863). 

+ Quart. J. of Math. 32 (1901), 28-42. 

{ Zeitschr. f. angew. Math. u. Mech. 5 (1925), 409. 

§ The notation follows that used by A. E. H. Love in the Treatise on the 
Mathematical Theory of Elasticity. 
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we have 

— ef) + vol z = a reel 22 
9 


— g(0)_1 (1) +_1 g(2) 5 
or Cuz ! Cuz © Cuz wid 


i.e. quadratic functions as well, where 


e(9) — 4 rx. o( Y (0) | Z)} 
a E a v 2z 


= 1 Yo 
Bb” 
Let us now assume the 2 and y axes of the coordinate-system to 
be principal axes of inertia of the cross-section, i.e. 
| | x dady = | | y dxdy = || xy dady = 0. (6) 
Then Michell derived the following expressions for the deformation- 
components 


e) = —KX 
e(2) 
uy 





»(0) eccsjae cease ee 1 ags2\ 1 Dee? ay’ 1 Bos) 1 7} 
Con = €g— Kg U— Kg YH 2Ko(x xy*)4 2K3(x +2x*y)-4 ™1o | (9)* 
, 
»(0) ,(0) 
Cor = Coa 0 
* These expressions have been specialized to exclude the St. Venant case 
and the m, case. The exclusion of the St. Venant case means that the 
forces and moments acting on the terminal section z = 0 and as specified 
under (a) are assumed to be zero. 
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where Ky, K}, T,, €9: Kg, Ky are constants, ¢ is the torsion-function, and 
y and y’ are the two bending-functions of the cross-section, i.e. 

V7 = 0 Vx = 0 V2x’ = 0 } 
, & @ P: (10) 
VF oe 
ox oy” 


For the constants he derived the following expressions: 


W,— yA sin « , _ W,—yAcosa } 
Ky = a a. = ap | 
2EI, 2K I, rs 
2» Co/C,— 2x, C4/Cy 
where 
A ( ( dady * [ ( y* dady I, = ( ( x® dady 


( d tale ( $ ae v)y| dady 
oy CX 


} 
| 


.. (12) 


(X94 x Loa?+ (1 Joy? y dey 
\oy ° lox - a 


| 
| 
| 
| 
| 
| 
| 
| 





ay ae ; 
(Ox » a 9 2\, {9X 179 x ee 
A 4 Loy? bo)xu*\a—{“ + (2+o)xyly| dady | 
(cy | COX j J 
Michell did not derive explicit expressions for e%), e, e®, or X®, 


(0 ™(0) . - e al 
Y‘), X‘, nor for €9, Ko; Ko: 


3. Such explicit expressions can be derived as is shown in the 
\ppendix. The results are 
C =u 
oy? 
9 r5 ’ 2 
2k x+ pox? + (1—do)ay"}—2yxs{x’ + (1+ 40)(2*y— hy*)} 
9 
e2 
— yy COS a —p7,(d-+xy)— 
72 


— ya sin a —pt,(d—xy)— 


1 4 Org 


QuKo{ y+ (1+ 40)(ay?— }a3)}— 2usf{y’ + doy + (1—}e)a*y} 
El eg—(kg tk 2 27)x— («p+ 27)y]+ 
+ Quy 6+ 2Ko(x-+-ay?) + 23 (x’ +-a2y) |+ 
Lol V2b-+-y(x sin a +y cos «)+ u(x, 23+-«} y*)| 


. *) + 2x, EK +(2+ apny | 4 


y ' 
cy 





+244] 7 + joy? +(1—Jo)e*] 
"Lay © * J 
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Z. = 


ie (2 — v) % 2a + for 
OX fad 


oa 
- 
y Ow 
x,=--2 
Oxdy 
where % is the Airy-function which has to satisfy the partial dif- 


ferential equation V2(V2y) = 0. (14) 


The constants €), x, and x, can be obtained by making ff Z, dxdy, 
ff Z, 2 dady, {{ Z,y dady vanish for z = 0. The results are: 

EA 
+2n(r | h dxdy +2, | | x dady +2«} | | x’ day) | 
i 
El, 


+2 (7 | du dady +2«, [ [ x dady +-2«; | xa day) + | 


le [ [ V7 dady +- 


€o 


: = V2ba dicdy + 


Ke = 


~@ 


. +oysinal, +4 pl 22+ Fopks Ia 
Ko = zs le | V2dby dady+ 
+2yu (n dy dady +2x, [ [ xy dady + 2x; | [ xy dedy) + 


. ny 





+-oy cos « I,,+-4«) pl 2.+ FopKs I,| 


where 

Tap = || oy*dedy, I= || x*dady, I= || y* dady. (16) 
The strain-components can be easily calculated from the foregoing 
expressions. 

The boundary conditions for ¢, x, x’ are those of the St. Venant 
solution. The boundary conditions for % can be derived from the 
expressions ee ee a = ee 

X, = X,cos(x,v)+X,sin(z,v) | (17) 

r r a. ? 

Y, = X,,cos(x,v)+Y,, sin(2, v) 
where X, and Y, are the given stresses acting on the prismatic 
boundary, so that 

| X,ds = W,, Y, ds = W,. (18) 
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The Michell problem is thereby reduced to the problem of finding 
the function % from the differential equation (14) and according to 
the boundary-conditions (17), i.e. to a problem of plane strain with- 
out body-forces. 

The functions ¢ and y are known for a number of cross-sections, 
e.g. the circle, the ellipse, and the rectangle; and by assuming special 
solutions of equation (14) it should not be difficult to derive expres- 
sions for the stresses and deformation of beams of such sections 
under different loadings. It might be mentioned that for such sec- 
tions, or generally sections symmetrical about the x and y axes, C, 
and ©% are equal to zero and therefore also 7,. Furthermore, for 
such sections the following expressions are also equal to zero: 


( ( ¢ dxdy, ( ( dx dxdy, ( dy dxdy, [ [ x dady, 
[ | x’ dady, ( [ xy dady, [ ( xx dady. 
This simplifies the above equations considerably. 


Appendix 
We start from the following equations: 
the equations of equilibrium 


AY (0) ( 
ORE, Oe 2 Gog = © 
Cx oy 

ay (0) ( 
: . P .. — + Y¥Y—ycosa = 0 
cy 


and the condition of sista 
o2 fo) a2 ke) 2,(0) 
ae Oe oe (20) 
dy? @a? Cady 
We introduce into equations (19) the expressions for Z{’ and Y{ from 
Zi = wed, YL) = weld, (21) 
taking e{) and e') from equations (8). 
We get accordingly 
c xT ox 2 
=a Y___y Cosa +p, , 
ex dy ox 


+240[% SX 4 Jox®+ (1—Jo 


3695.4 
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ey ax) : a 
cy Ox oy 
91 - 
tomy + 2p 4 
oY 
Now we introduce into equation (20) expressions for the stresses, 
by considering equations (5), and get 
“ co] { oD 
e2 X (0) e2zy) o2X) g2yo) E ex® 
Fs . Y panels A 
pe oxoy 
Furthermore, there follows from the first of equations (5) 
Zo) | eg — Ky X— Ky YA 2ko(x + ry?) + 2K5 (x + ay) +7, ¢]+ 
o(X+4¥), (24) 
Introducing (24) into (23), and considering equations (10), we get 
g g eq 


ax gy a a) 6 


@ = v2) 


oy? 02? Ox? oy? 


hd 4Bo(Kyx+K,y)— 
oy” Ca? 
E axe 
* Oxey 
Now we differentiate the first of equations (22) in respect to 2 and 
the second in respect to y, and get 
02X19) vig bow Oh 


‘ 9 1 ‘ ‘ 2 BT, Ds 9 
ox* Oxcy ox" 


ase met 
ox* ey" 


= o?V2( Xo +. Y) : 


ad tg o2X{0) 


oy? ° Oadoy 


- Q 


‘ 
| 
| 
| 
| 


and, if both these equations are added and again (10) kept in mind, 
this gives 


¢ 2) (0) ¢ 2y (0) ‘ 2X (9) a 
: XY - Y sae SS 4u(1 +a)(Ky% 7 KY) = 0. (27) 


| 


a9 T a, T 5.2 : 
Ox* oy" OxCYy 


axe \0) o2 2y (0) , : : 
By introducing the value for = +——¥ out of this equation into 
0x? oy? 


equation (25) we get 

22 Vv 02 i ta 

o2X (9) oz Y) 2x) Be le | | 
—— + 4 — 2 — V(X + YY) —4po(1+-0)(K,2+K,y) = 9, 
oy ox" On oy (28) 
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since p= a, (29) 
2(1-+-0) 
From equations (27) and (28) there follows 
V2 XY + YP) — oP V(X) + YP) —4(1—o?)(xaz+«,y) = 9 (30) 
i.e. V2(X9 + YW) + 4u(kga+Ky) = 0. (31) 
Finally equations (28) and (31) give 
o2X (0) e2 Y) o2X (9) 
a t+ 2 —Anolnaetecy) = 0. (82) 
oy” Ox? Oxody 
Now let X99”, Y”, and X” be particular integrals of differential 
equations (22) and (32). T hen 
xov — X¥O-_-Xe", Yor- Yo-yo", Xe" — XP—Ze" (33) 
are the stress-components of a certain state. To determine this state 
we have to consider that, because of the definition of X{”, Y(”, 
X‘”, equations (22) and (32) give 


OX” ox ae Op _ 
= — y COS a -+- pT, Dx yj+ 
OX 


¢ 7(0)" 
ox . 


Ox 


—ysin a tuns(o+2) + 
oy 


2 sitet (2+o)ay | +2 “i fOx + foy?+ 

ls igh ) Y - \ ay id 

a2 ¥ (0)! 22 yo)’ a2X (0) 
Xe Oa" 9 OX _ syalnye-+ngy) = 0 
oy? Ox? Oxody 





[f we now deduce equations (34) from equations (22) and (32) we get 
ay 0)" a ¥(0)" ayo)” ax 0)” 
oxS f seat ik ory These _ 90 
ar o oy dx x 
ioe ae . ee 
n° 4 “ a9 y a” O49 4 
PX? rr _ gAX? <_" 
oy” 0x” Oxdy 
This, however, is a problem of plane strain with no mass-forces, 
and equations (35) can therefore be satisfied through the Airy func- 
tion ¢: 
a2 o2 a2, 
xo” _ © op yo" —_ op Xo” _ oe 
fiz = ae _— a aD 
x oy” V Ox 2” y Oxoy ; (36) 


sas 
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The question therefore is only to find particular integrals of equa- 
tions (22) and (32). This can be done as follows. From equations 


(22) we have, through integration, 
(0) 
XO 4 [ oxy dx — ya sin « +p7,(¢—xy) + 2uKy{x + tox3+ 
J yy 
+(1—$o)axy?}+ 2pnx3{x’ +(1+30)a*y}+ Fy) = 0 | (37) 


= 


Px 7(0) [ 

ry + | CAN dy —yy 0080 +yr(b-+2y) + 2pnefx+ (1+ 4o)xy?} 
, Cx | 
+ 2Ques{x’ + boy? + (1—}0)2?y}+F(x) = 0} 


where F, is a function of y only and F, a function of x only. Out 
of these total integrals, particular integrals can be derived by 


assuming xo” — 9, (38) 
Vv 
If now these particular integrals are introduced into equation 
] g 
(32), this gives 
, EF dF. 
2u(2+0)(K,¢a+K,y)+—) 2— 0, (39) 


' dy * da 


From this it follows that we can assume 


Fr DP 
. w = —2p(2+o)K,y, . = = —2Qp(2+0)Ke2, (40) 
dy" dx? 
or Rly) = —$ul2+o)ey®, F(x) = —4u(2+o)eqa%, (41) 
and our particular integrals are 
XO’ — —yaesin « —p7,(d¢—ary)—2pKo{x-+ $02? + ) 
+(1—}o)xy"}— 2uxs{x’ + (14-40)(2*y—dy*)} | 
Yo’ — —yycos Erb — ey) — 2palxt (1+ doNey? + 32) — (42) 
— 2prs{x’ + oy? + Bi 


i 
XO” = 0 














A CLASS OF TRIANGLE-FUNCTIONS (II) 
By J. HODGKINSON (Ozford) 
[Received 11 July 1932] 


In my earlier paper I discussed the general form of a zeta-Fuchsian 
triangle-function, i.e. a one-valued function w which undergoes linear 
transformation when its argument wu undergoes the linear trans- 
formations associated, in the Schwarzian theory of conformal repre- 
sentation, with a curvilinear triangle of angles z/m, 7/n, 7/p, where 
m, n, p are positive integers greater than unity.* In this paper 
[ obtain explicit evaluations of w when the group of u is polyhedral 
or is associated with the representation of a rectilinear triangle. 

It must be acknowledged that the general form discussed in (I) is 
itself an explicit evaluation for the special functions considered, for 
w is expressed in terms of hypergeometric functions of a variable z,f 
together with an arbitrary one-valued function of z, and here z is 
known explicitly in terms of u. This evaluation, however, is need- 
lessly complex, and fails to display, for example, the simple algebraic 
character of zeta-Fuchsian polyhedral functions. 

[ shall use the notation u(«, 8, y), w(A,,v) to indicate that I am 
considering the determination of a function w which undergoes the 
group of transformations associated with a triangle of angles Az, pz, 
v7, when the variable w undergoes the group associated with a 
triangle of angles az, Bz, yz. 

[ find the following is the list of problems to be considered: 


(i ; all polyhedral cases; 
(ii 


(iii 
(iv 
(Vv 


(vl 


— ie 


) 
) 
) 
) 
) 
) 
) 


(vil 


* J. Hodgkinson, Quart. J. of Math. (Oxford), 3 (1932), 142-51. This paper 
will be referred to as (I). 

+ As in (I) I suppose always that the u-triangle has been mapped upon the 
half-plane of z, with the angular points corresponding to the z-points 0, 1, «, 
so that z is the fundamental function automorphic for the group of trans- 


formations of u. 
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1 1 
3° 3? 3/? 


(viii) w(4, 4, 

(ix) u(3, 4,4), WG 

(x) w(3, 4.4), w(4, | 
In this list the problem marked (a) is really included in the other 
problem with the same numeral. An equilateral triangle may be built 
up of two Schwarzian repetitions of a triangle of angles $7, }7, $7; 
an isosceles right-angled triangle similarly may be built up of two 
repetitions of a smaller isosceles right-angled triangle. The group of 
u in the problem (a) is a sub-group of the group of w in the other 
problem, and the groups of w have an exactly corresponding relation. 
The sole difference in the two problems is in the argument of the 
arbitrary one-valued function of z required in the construction of the 
general function w; this, of course, must be the fundamental function 


1 
6 
1 
4 
1 
4 


automorphic for the group of w. 

All other zeta-Fuchsian functions associated with polyhedral or 
rectilinear triangles may be included in one of the types given as 
the general arguments of (I) indicate. 

It is implicit in (I), though not clearly shown, that two groups 
may be associated with a rectilinear configuration but only one with 
the more general curvilinear configuration. The reason for this is 
obvious when the rectilinear configuration is inverted with respect 


to an arbitrary point O. 


Fia. 1 


Whereas in the general case (I, p. 147) it is possible to select that 
one of two triangles of angles A, ~—B, ~—C, whose sides occur in 
the order BC, CA, AB as the contour is described in the positive 
direction, no such choice is possible in Fig. 1 above, the plane now 
being divided into four triangles and three lunes. For such a triangle 
we use the repetition of a triangle colunar with ABC obtained by 
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inversion (reflection) in one of its sides. This is shown in the figures 
of §§ (viii), (x). From our present point of view the two groups are 
distinct, though consisting of the same substitutions in different 
orders, and this displays itself in the difference of character of the 
functions w of (viii), (x) when compared with those of (vii), (ix). 

In place of using the general form given in (I), I find it more 
convenient to revert to the initial stage of that paper, where it was 
seen that w is given by 

(w—w,)(Wy—Ws) 

(w—wsy)(w.—w,) @) 
where w,, Ws, w; are any three functions admitting the group of trans- 
formations required to be associated with w, so that each problem is 
solved by finding three such functions. I omitted to note in (I) that 
only two functions are needed when the group of w is associated with 
a rectilinear triangle. For then every transformation of w is of the 
form W = aw+b, whence it follows that (w—w,)/(w,—w,) = f(z). 
This applies to §§ (vii)—(x). 


(i) The polyhedral cases of u(«, 8, y), w(«, 8, y). 
[ have discussed these elsewhere from a somewhat different 
aspect.* The simpler method used there, i.e. by differentiation of 


the covariants of the ground-forms associated with the polyhedral 
groups, is the more suitable for our present purpose, as it enables us 
to calculate, not only three, but any number of functions having the 


required properties. 


(ii) w(}, }, 4), w(d, $, 2). 

The dihedral group associated with the triangle of angles $7, 42, $7 
is essentially the anharmonic group, and we accordingly replace w by 
A to suggest the latter group. The relation between A, w occurs 
frequently in Klein’s writings; it is given by him in the formt 
(u?—1)? 


A= — 
4u? 


A proof by the method used in (v) below is very brief. 
The unit triangle associated with the anharmonic group has angles 
of the form z/m, 7/n, 7/p. Hence functions of A admitting the same 


* J. Hodgkinson, Proc. London Math. Soc. (2), 24 (1925), 71-82 (77-8). 
+ Klein-Fricke, Vorlesungen tiber die Theorie der elliptischen Modulfunc- 
tionen, 1 (1890), 20-1, and 74-6. 
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group as A itself can be found by the method of differentiation already 
referred to. Two ground-forms associated with this group are 
N—A, Ay+A3, 2A?— 3A? A,— 3A, A3+- 203. 
From these, we obtain as our second and third functions 
A—2 A?+ 2A—2 
2\—1" 242? 2A—1 


(iii) w(t, 4, 1), w(d, 4, 2). 

The group of w is the icosahedral group but occurs in a different 
order from that associated with uw. We have to determine three 
functions w,, w,, W3, and we do this most simply from the considera- 
tion of a triangle of angles 47, 47, 27, and its associated triangles of 
angles 47, 47, 37, and $2, 32, 27. The choice of the first is not 
unique; it can be made in sixty distinct ways in the same icosahedral 
configuration; but when this choice has been made the associated 
triangles must be taken so that they are bounded by the same three 
circles taken in the same order (I, p. 147). Fig. 2 shows the 
relation of my choice to the conventional icosahedral subdivision of 
the plane. 

I find that 


F : . , 26u9°—39u5—1 
Wy = exp(si7) —, ’ W, = exp(fi7) —_,— =e 
7 +- : : u3 — 398 — 26u3 


uw? +. 119u22+ 187u7+ 170? 
17w15— 187u!9+ 119u5—1 


-exp(Sim) 


The numerators and denominators of these fractions are to be recog- 
nized as forms occurring in the researches of Klein, who points out 
that they also occur in those of Gordan, on the solution of the equa- 
tion of the fifth degree. Their connexion with conformal repre- 
sentation was shown by Fischer in a dissertation which I have not 
been able to consult in its complete form.* I describe here a method 
which obtains the three functions directly. I consider only the 
function w,, omitting the suffix for convenience. 

The triangle associated with w contains seventeen repetitions of 
the unit triangle of the configuration. If wu is subjected to the full 
group of icosahedral transformations, so that the whole plane of w is 

* F. Klein, Math. Annalen, 12 (1877), 502-60; Vorlesungen iiber das Ikosa- 
eder (1884), 189, 196, Klein-Fricke, Vorlesungen iiber die Theorie der ellipti- 
schen Modulfunctionen, 2 (1892), 385-6; P. Gordan, Math. Annalen (1878), 
375-404; O. Fischer, Leipziger Berichte, 36 (1884), 17-31. 











The darkened lines show the boundaries of the triangles; the figures below show 


which are the three triangles chosen and the positive sense of each contour. 
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covered completely by the repetitions of the unit figure, the plane 
of w is covered seventeen times. Hence the relation between w, w is 
of the seventeenth degree in wu. 

An angle 47 at the origin in the u-figure is transformed into an 
angle 27 at the origin in the w-figure. Hence u = 0 is a double zero 
of w. The transformation w’ = wexp(iiz) corresponds to the trans- 
formation u’ = wexp(2t7). Hence 

w= uF (u*). 
As u moves from the origin along the side of the initial u-triangle 
which coincides with the real axis, argw = $7, whence 
gs \ Ag tt? +0, Ul? +a, U" +a, U? 
: exp(t*7) by u5+-b, u+-b, u> +b, 
and a,/b, is positive. 

As u tends to infinity, w tends to infinity (for aj cannot vanish if the 
required degree of the relation is to be preserved). Thus there must 
be an icosahedral transformation of the form w’ = —exp(zkiz)/w 
corresponding to the transformation u’ = —1/u. If, then, the zeros 
of w are given by u® = a°,f*,y°, the poles of w are given by 
u® = —1/a5, —1/8°, —1/y*. But by considering a suitable succession 
of repetitions of the initial w-triangle and the corresponding repeti- 
tions of the initial w-triangle, it is found that w has three negative 
zeros (shown in Fig. 3 in the 4th, 5th, 6th repetitions), and that, as w 
tends to infinity in the negative direction, w tends to infinity in the 
direction arg w = 47; hence, if w’ -1l/u, w’ = —1/w. 

From these considerations we can write 

= See 2 
re —exp(tin) BE oa a 
vu>— pul? rur—1 
where A, p, v are positive. 

Now the effect of applying to w the transformation which Klein 

denotes by 7'(w) is to apply to w a transformation 7"(w). T'(u) is 


equivalent to successive inversions in those sides of the initial 
u-triangle which contain the right angle; similarly for 7’(w). T'(u) 
transforms 0 into a, « into 0, 00 into —1/a, —1/a into oo (three of 
these suffice to determine 7') where « = }(v5—1);* i.e. 


—U-+-a 

T(u) = i 
aw-+ 1 

* The conventional scales are used for the polyhedral configurations 


throughout the paper. 
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T’(w) transforms 0 into exp(}iz)/a, exp(liz)/« into 0, 00 into 
aexp(Sizr), and aexp(Siz) into 00; ice. 


. ,—awtexp(ti 
T"(w) = exp(tiz) eee 


A 


The figure shows the initial w-triangle and six repetitions; the seven triangles are 
supposed numbered from the right for comparison with the w-figure below.* 





SHR 


Fie. 3 


[f now we replace u in the expression for w by 7'(w), we obtain 7”(w), 
and we find, after reduction, that we have to determine A, p, v so 
that the equation 


(u—«a)!7—A(u—a)**(aw-+ 1)®+- p(u—a)? (a+ 1) —v(w—a)?(axw-+- 1) 


v(u—cx)*(au+ 1)? po(u— ox)! (actu 1)? -A(u— ex) (xe 1)!?+- (au 1)"7 
— au?(wl>+ Aul+-pwud+-v)+ (vul®§—pul?+dAu—1) 





u?(ut> + Awl+- wudtv)+a(vul®—pwul?+Aue—1) 
is an identity. 
* It is impossible to put the seven w-triangles into one figure on account of 


overlapping. The size and orientation of the triangles enables their position 
in the icosahedral configuration to be determined. 
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Clearly «= « must be a double zero of the numerator of the 
right-hand member of this equation. Hence 
A(a5—ot8)—pu(a8+ ao!) —v(a3—a!5) = 1+ a8, 
and A(5ae2— 12a!) — (705+ 10a7)—v(2—15a!) = 17a}, 
After substitution of the value of «, we find that 
3A—p— 8v = 34 
and 
\(655v5— 1461) —p(165vV5—367)—v(2160V5—4826) 
= 34(305v5—682). 
Further, the coefficient of u'4 in the numerator of the left-hand 
member must vanish; i.e. 
2A(1 — 1202+ 33a04— 2208) —u(2205— 63a? + 4209 —7ol)+ 
+ y(91o01°—42q12+4 314) = —136a, 
or (89V5—119)A—(53V5—118)u—(44v5—99)v —68(v5—1). 
Hence we find that A = 119, uw 187, v = 17, so that 
wt? +. 1192+ 187u7 + 17u? 
17u5— 187u+-119u5—1 © 


—exp(Si7) 


The derivatives of w,, w., ws contain as factors the icosahedral forms 
w+ Viw5—1, (wi9+-1lw5—1)?, and w?°—228u!5+ 494u!0+ 228u5+- 1. 
It is easy to see from the geometry of the configuration that this 
must be so, and this gives a check on the working. 


9 


iv a. 3 fl) 3 Ae 2. S a ae 
(iv) w(2, 3, §)> W(2, 3, 3) (IVE) U3, 3, 3), WUE B 3)- 
The fundamental functions associated with the u-triangles were 
obtained by Love,* who integrated the corresponding Schwarz- 
Christoffel equations. He gave the mapping of the triangle of angles 
la, 47, do upon a Z-half-plane in the form 
o 1— @3(u 
g— *— “| ) 
1+ 07(u) 
where g’*(u) = 4{3(w)— 1}. 
These special elliptic functions have a real half-period w, and two 
complex half-periods w., ws, where 
% ~ 1 
1 P(3)P() 
6 a) ’ 


e denoting exp(%iz). 


Wy Ws 


* A. E. H. Love, American J. of Math. 11 (1889), 158-71. My notation 
does not agree with that of Love. 
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The angular points A, B, C of the w-triangle are w,, 4w,+3wo, 0, 

corresponding to the Z-points 0, 1, —1. As, however, we have it 

always in mind that the mapping of a triangle is associated with the 

hypergeometric equation, we transform the Z-points 0, 1, —1 into 
the z-points 0, 1, 00, so that 

2Z 

Ds oe 

The w-triangle of angles $7, 47. 47 is the unit triangle of the 

tetrahedral configuration and six repetitions build up a quadrant of 

the w-plane. Six corresponding repetitions of the wu-triangle build 
up the rectangle whose vertices are 0, w,, 2w,-++2w., w,+2ws. 


1— 3(u). 


X 


wi - Figure u- Figure 
Fic. 4 

The w-figure shows the configuration in Klein’s second position of the tetrahedron 
relative to the axes of coordinates. 

The quadrant is transformed into a half-plane by the relation 
w? = f. With the conventional dimensions for the w-figure, C is the 
point 7(2—~3)!, so that € = —(2—~v3). 

Now consider the relation Z = g(u;Q,,Q,) where Q, is real, 
Q,+0,+Q, = 0, R(Qs) = 0, and J(Q;) is negative. This relation 
maps the rectangle whose vertices are 0, Q;, Q,+03, Q, upon the 
half-plane of the variable Z. Hence, to map the rectangle whose 
vertices are —Q,, 0, Q,, Q,—Q;, we have the relation 

Z = E(U+Q3;Q,,.Q), 
i.e. a linear transformation of @(u;Q,,Q,). To bring the points C, 
A, Q into correspondence with three assigned points on the real axis 
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we must apply a further linear transformation to Z. Now, in our 
figure, Q, = w,, Q, = —(w,+2w,.), and therefore Q, = 2w,. Hence, 
writing (wv; w,, 2w.) as ,(u), we have 
a af),(u)+B 
yP(u)4-8° 
where a, 8, y, 5 are to be determined. 
We require that £ = 0 when u = o,, ¢ is infinite when u = w,+2w, 
and ¢ = —(2—~v3) when wu = 0; i.e. 
c= —(2—N3), (P1(U)— My oul 
{P1(U)— Py (w+ 2 
But —(u; w,, ~~ a( du 
a(3u)+ —(du+a,)— (a )}, 
where the elliptic functions in the last expression are formed with 


L; ka, Ws) 


periods 2w,, 2w.. 
Hence ee. + (su Foo) 20(ber)} 
{P(s’ & (4 -++-w)—29(}w,+9)} 
But —(4,) 3+1, e( : —*3-+1, and hence 
¢(3u)—vV¥3—1}" 


¢(su)+V¥3—1}?’ 


i.e. 


( 

( 
) P( iu) v3—1 
@(4u)+v3—1’ 


the sign of w being chosen to make /(w) positive, when wu is a small 
positive number. 

The tetrahedral triangle has angles $7, 7, 47. We find two other 
functions admitting the same group by differentiation of the ground- 
forms w+ 2V3w? w3—w. Our second and third functions may thus be 

—~v/3w?+ 1 V3w?-+ 1 
ur+tv/3w ’ w>—/3w- 


(Vv) (3, 3, d), W(d; 2 §)- 

As in (ii) above, we may replace the dihedral group, w($, $, 4), by 
the anharmonic group. 

Six repetitions of the unit triangle of the A-configuration build up 
the A-half-plane, and the corresponding repetitions of the u-triangle 
build up an equilateral triangle. 

Love (l.c.) gives the representation of an equilateral triangle by 


21/9’ (uy) 


means of the formula Z- 
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with the same special elliptic functions as in (iv) above. In his work 
the vertices of the equilateral triangle are 0, 4w,+-3w», 3w,+$w, in 
the plane of u,, and these correspond to the Z-points 0, 1, —1. In 


es A 
A- Figure 








A 


a ee u- Figure 


Fic. 


our u-figure the vertices are 0, 2w,, 2w,-++-2w,, and the corresponding 
A-points are 0, 1, 00. Hence we write 


A= a and u = V3u,exp(—ii = v3u, €? 
=7,, ™ 4 = VdUu, exp(— fim) = iv3u, e, 


1; -+- 


47 4i , , 
——, —.—_—— = ——_,_____., for 9’ (eu) = @’(u). 
21+ —'(—tWe/v3)  21—¢"(tu/v3) 


We take the same second and third functions as in (ii) above. 


(vi) u(d, $, ), w(d, 4, 2); (via) u(, 4, 4), w(4, 3, 4). 

The fact has already been remarked that an isosceles right-angled 
triangle may be built up of two repetitions of a smaller isosceles 
right-angled triangle. For this reason the fundamental function of the 
triangle may be presented in an endless variety of forms, connected 
by transformations of the special elliptic functions. I find it convenient 
not to use the form given by Love (l.c.) but to obtain another. 
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It is found by means of the Schwarz-Christoffel formula that a 
square in the u-plane is mapped on a Z-half-plane by the formula 


Z = Q(u), 


where g’*(u) = 4{93(uw)—e(u)}. 





A B A 
u-Figure 


Fic. 6 


These elliptic functions have a real half-period w,, and an imaginary 
half-period w,, where* 
= , D(z) P(2) and ws = —ta. 
4 V(%) 

If now the w-point describes the contour of the square whose 
vertices are —2w 3, —w 3, w;—W 3, W,;—2ws, the Z-point describes the 
real axis from —oo to oo, the points —oo, —1, 0, 1 corresponding 
to the vertices A,, B,, C, D of the u-figure (Fig. 6). 

By employing Burnside’s method} for figures built up of repeti- 
tions of another figure, we find that the triangle A, B, C is mapped 
on a z-half-plane by the formula z = 1/g?(u), the z-points which 


* I take —iw, in order to comply with the conventions I(w,/w,) positive 


and w,+wW.+w, = 0. 
+ W. Burnside, Proc. London Math. Soc. (1) 24 (1893), 187-206 (194-7). 
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correspond to the vertices being 0, 1, 00. But the triangle ABC is 
a repetition of the triangle A, B,C. Hence 1/ (uw) is the funda- 
mental function for this triangle also. 

Now eight repetitions of the w-triangle build up a half-plane, and 
the corresponding repetitions of the u-figure build up the square 
whose vertices A, A’, A”, A, are 0, 2w,, 2w,—2w3, —2w,. In the 
w-figure A, A’, A”, A, are 0, 1, 00, —1. The relation between w, wu is 
obviously a transformation of that between Z, u. In fact, we have 
merely to replace wu by }u—wg, and apply a linear transformation to 
Z to obtain w, so that 


where a, 8, y, 5 are to be determined. 
Now «a = 0 since w = 0, when u = 0; 85 = 0 since w is infinite 
when u = 2w,+2w,; B = y since w = 1 when u = 2a), 


1 
(3) 
Two other functions are obtained by the method of differentiation 
from the fundamental forms w!+-wi, w,w, associated with a triangle 
of angles 47, 47, }v. They are —1/w* and —w. 


(vii) (4, }, 4), wd, 4, 4); (vida) w(h, 4, 4), wld, 4, D- 
We note first that, the groups of u, w being identical, w itself will 
suffice for one of the two functions we seek. The figure shows a 
generalized triangle which is associated with the same group. 
The representation of this w-figure is given by the formula 


dw , 2# 
dz (z—1) 


where K is a constant to be determined by the scale of the figure, 
the z-points 0, 1, co corresponding to A, B, C. (This is not quite the 
ordinary Schwarz-Christoffel formula, but a slight modification which 
applies when one of the angular points of the rectilinear figure is at 
infinity.) Writing, from (iv) above, z = 1—g*(u), and adjusting the 
constant to convenience, we find that 


a 9" (u) 
w= see +8. 


3695-4 D 
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But when u = 0, w = 0, and therefore 8 = 0, and when wu = a, 
w = w,, and therefore a = }. 

The two functions we require are 








1 Q’ 
. je QP (u) 
6 —7(u) 
c 
A 
w - Figure 
By 
Fic. 7 


(viii) w(4, 4, 4), w(4, %, 4); (viiia) w(4, 4, 4), w(%, §, 4). 
The figures (Fig. 8) show two generalized triangles associated with 
the required group. Their representations are given by 








dw 1 dw, i 
to Ps i es 
dz zt(z—1)*” dz “(z—1)! 
on i 
we ' = 
a | a | 
Pid lA ii iA 
i eee, wa 
8 
w, Figure w ,- Figure 
€ Cc 
Fic. 8 


Again, writing z = as we get 
wy = al(u)+B, Wy = ¥{C(w)+$O*(u)M'(w)}+8. 


It is easy to see e the transformation W = —w corresponds to 
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the transformation U = —u. Hence f, 8 both vanish; also, when 
U = wW,, W = w; i.e. 

w,=<*C(u), wy = “*{L(w)+4.9*(u)@'(w)}. 
1 el 
Since C(ew) = e*f(u), n. = €2y,, and hence, from the formula 
1 @2—N2@, = tn, we find that 7, w, = 7/2V3, so that the constant 


n, is known. 


(ix) u(4, }, 3), w(4, 4, 4). 
As in (vii), wv itself will supply one of the two functions we seek and 


the figure shows a generalized triangle associated with the same group. 


Cc 





w- Figure 


Fic. 9 


The formula for its representation is 


dz z 


dw _ Ke 4 


Writing, from (vi) above, z = 1/¢@?(u), we find that 
, 
w = af4u—'(u)}+-B. 
Since the transformation W = —w corresponds to the transforma- 
tion U = —u, then B = 0; since w = w, when u = w,, a = }. 
The two functions required are 


U, u—}e'(u). 


(x) w(}, $ B), wld, b 8). 
The figure shows two generalized triangles associated with the 
required group. Their representations are given by 
9 4 9 a » 3 
du, __—s=—sOCK,«x dw, _ K,(z—1)! 
=: ; = . 
dz zt(z—1)! dz zt 
D2 
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Writing z = 1/@?(u) again, we find that 
w, = al(u)+B, Ww, = y{C(u)+4E(u)e’(u)}+6. 








Since the transformation W = —w corresponds to the transformation 

U = —u, B, 6 both vanish; since w = w, when u = w,, a = y = w,/7). 
A | 

/ ae | ; 
ila 4 i 
re i 
Pee “4 1B 
1 
! 
C Cc 
i 
1 
w; Figure w,- Figure 
A A 
Fic. 10 


The two functions required are 
pad! pee 
17m), 14(u) +-20(w)g'(w)}. 
M1 11 
Now for these elliptic functions {(—iw) = i{(w), and therefore 
3 = %y,, and hence, from the formula »;w,—7,w3 = }t7, we find 





that y,#, = }2, so that the constant 7, is known. 
We note that the general formulae in §§ (vii)—(x) take specially 
simple forms. They are, after obvious simplifications, 


(vii) w = w+ @(u)e'(u) F{@*(w)}. 
(viii) w = “*Z(u) + @*(u)@'(w) F{O%(w)}. 
(ix) w = +6") F{p*(u)} 

(x) w = “2¢(u)-+ Ow)’ (u) F{O%(u)}. 


i 








THE REPRESENTATION OF A NUMBER AS A SUM 
OF FIVE OR MORE SQUARES 


By E. MAITLAND WRIGHT (Ozford) 
[Received 5 July 1932] 


Introduction 

|. THe problem considered here is that of representing a number n 
as a sum of squares ‘almost, proportional’ to assigned positive 
numbers. If 0<B<1, 
and if Ags Agysos5 Ay 
are any assigned positive numbers whose sum is unity, is it possible 
to express every positive integer » in the form 

n m?—+-m3-+-...+-m?2, (1.01) 
where |m?—A,;n| = O(n}-*), (1.02) 
and what is the largest value of 8 for which this is true? In this 
paper I shall show that such a representation is always possible 
provided that 


=e nt ak a _~fe—4 ‘s-—3 } 
5, p< = oe) = a’ 2(2s—1)}" (1.03) 


On the other hand, such a representation is not possible for all 
n if B > 4. Let [v] denote the greatest integer less than or equal 
to v. Let us take r any positive integer and n such that 
[A;n] = r?-+r. 
Then, 
A;n > 2, [A;n] > JA; n, r > r?+r) = An] > Ayn. 
\lso, if m? is the nearest square to A;n, 
\m?—A,;n| > r > 4 ni. 
If we let r run through all the positive integers, we obtain an infinite 
sequence of values of » for which the last result is true. But (1.02) 
cannot then be true. 
The question remains unanswered if f lies in the closed interval 
(8) <B<h. 
Since a(s) < }, this gap is considerable. 
My method is based on the Vinogradoff method* for the solution 


* M. Gelbke, Math. Annalen, 105 (1931), 637--52, gives this method in its 
latest and simplest form. 
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of Waring’s problem. Incidentally, I find an asymptotic formula for 
the number of solutions of (1.01) satisfying the conditions 
\,;n—A;ni-B < m3? < A,n+A;,n!-F (-=' 1,3 

where A,, Ag,..., A, Mi; Tet Be 
are any assigned positive numbers. 

I hope later to discuss the same problem. for powers higher than 
the second, and for 3 and 4 squares. In the latter case, certain less 


complete results are obtainable. 


Notation 
2. In what follows s, i, n, h, j are positive integers, 1 
8 

and we write [| for []. Also 

i=1 
B, X;, Aj, A; (¢ = 1,32....,8) 
are any assigned positive numbers subject only to the conditions that 

0< 8 <— i, A AA+... +A, = 1. 
Later a more restricted condition will be imposed on f. We write 
s—4 


5<s< 8), 
{s—4 es | 3s—4 ( - = 


x(S) a I 2(2s—1) = 
Y = y(s, B) : ts—1—(s—1)B, 


s 


A > As A’ =¥ Ai, 


1)=2" (2>8); r(7z)=90 (2 < 
2 = P(A... Mig Bape eed 
(A'\— > 1(A’—A,—At)+ 


1<17S5 


a ee ae a a, ae 
1<i<i’<s 1<i<i’<i’<s 


We use « to denote a positive number, to be thought of as small; 
its choice will be subsequent to that of s and 8. C and ¢ are positive 
numbers, whose values vary from one occurrence to another. C de- 


pends at most on 

Bh gyonry Mii Annies tas 
c depends on s and 8 only. The constant implied in the O( ) notation 
is of the type C. 
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The relation 
ra) +0 
Qi—2)"* =F d= > 4 (|z| < 1), 


m=0 “n=— 


defines d,, = d(m,s), so that d,, is zero if m is negative. 
We write ; 
p =exp(2mip/q) ((p.q)=1), 8, =>", 
AQ5) = ~* 2 Spe’ 
where p runs es all oe qth roots of unity; 
A,(q.j); 


“Ben 
a 


s(7,J)- 
N,; = [A;n+A;n! : y = [A,;n—Ajn'-8+ 1], 


S ( 
) 


L=n— s Ni, = A’n'-#+0(1), 
iF 


M, = N,—Nj+1 = (A;+A4j)n'-?+0(1). 
We define r(n) as the number of solutions of 
n m?+-m3-+-...+-m§ 


such that 
m,;, > 0, A;n—Ajn' B<m?<A,n+A,n!-8 (¢ = 1,2....,8 


or, What is the same thing, 
Ni < m? < N; (6 = 1,3....,@); 
v(n) is the number of solutions of the equation 
n= jrtiet--tde 
which satisfy the conditions 
N; < ji < N; 


We write fil) =_ > 2”, 
Ni<IE<N; 


N; x 7 
fp, j(X) = > J” i J. 
Then r(n) and v(m) are the coeflicients of x” in 
Ni ‘ 
Tf), TI{ >} 
J=NG 
respectively. We define (7) as the coefficient of 2” in 


II {1 :(”)}. 
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Then the coefficient of a” in 
IT t,,<(2)} 
is q*Spp-"e(n). 
We take the Farey dissection of order 
1 = [ni0-P)] 
on the circle & (|x| = 1). Then associated with every point p on & 


for which q < nap 
we have an are It. For each p we write 
© = petty, 
Then we know that y| < (ql) 


y| <. 
on the are I associated with p; while every point for which 
y| < (2ql)7 
certainly lies on this Mt. Each arc M is divided as follows. The part 
for which ly| < q-in-t-# 


is denoted by MW’, the remainder, if any, by M”. 


The values of y(n) and v(x) 
1. We first find the asymptotic value of v(n), and from this 
deduce that of p(n). 
Lemma 1. If s > 2, then 
d,, = {7(m)/T(s)}+ O(ms-*) (m > 
d,= 1, 


d,, = t(m)/T(s) (m < 0). 


_ 8(1+8)...(m+s—1) — (m+1)...(m+s—1) 
y a le 


= {ms—1/I\(s)}4+- O(ms-*) = {r(m)/T'(s)}+ O(ms-). 
If m < 0, d,, = 0 = r(m)/T(). 


m 
3.2. Lemma 2. If s > 2, then 
Tn(s-Da B)| 
n ; 
v(n) — - < Ons-20- B). 
re) | 


We see that v(x) is the coefficient of x” in 


n—L =_— aM 
(>) = | x ins oe “ 


Ay; 
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and is therefore the coefficient of x” in 
TI Q—2™) — 1—a a — a, it Mat... 
(1—a)8 (1—2)* 
But this is obviously 


dy — > dp-u,+ x dy_u,—-M.—-- 
Sst SS 


1<i<s 
“= a [(L)— & L—-M)+ > (L—M,—M,)—...) +O). 
Then, if we substitute for L, M;, we have 
Bit PB ' ’ = s—2\1— 
v(n) = —~ 1(A’)— > 1(A’—A,;—A;)+...} + O(ne--B) 
T(s) | i<ixs J 
Tn's-a-B) 
ee 
3.3. Lemma 3. If s > 2, then T > C. 
We see from the definition that 7' is independent of n. Then, by 
Lemma 2, it is sufficient to show that 
v(n) > Cn-2A-) 
for all large n. Now v(n) is the number of solutions of 
L = h,+h.+...+h, 
which satisfy the conditions 
0<h, << M,—1 (6 an 1, 3....,8). 
Let B be a positive number such that 
8B < min(A,, Ag,..., A,, Aj, A3,-.-, Ag). 
Let us choose h; (1 < « < s—1) so that 


= 


(A;— B)n1-8 < h; < (Aj+ B)n'-8, (3.33) 


S 


+ O(n'’-20- B)). 


We can choose h; in [2Bn!-*] ways so as to satisfy this condition. 
Hence, there are [2Bn!-*}*- sets of numbers 


hy, hg,..-y hy 

which satisfy (3.33). But these numbers also satisfy (3.32), since 

0 < (Aj;—B)n!-* < (Aj+ B)n!-8 < M;—1, 
if nm > C. Also, we can take 

h, = L—h,—h,—...—h,_1, 

and we shall have, for sufficiently large n, 

h, > {A;—(s—1) Bjn- 4+. 0(1) > 0, 

h, < {Aj+(s—1)B}n1-8+ 0(1) < M,—1. 
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Then to each of the sets satisfying (3.33) corresponds at least one 

solution of (3.31) satisfying (3.32). That is, for all sufficiently large n, 
v(n )> > [2Bn! Fp- 11> Ons-a- B). 


3.4. Lemma 4. If s > 2, 
. | 
p(n) — —_—__- ‘a < Cny-B4+ Cny-1+8, 
2s1(s)(T J A;)! 
We have 
. n) = 3° 5 (TT 37"), x(n) = > 1, 

the ii tiene pee over all sets of integers j,,Jo,...,jJ, Satis- 
fying (2.01) and (2.02). For such values of j;, 


1 <14< 11%, 


Now 
TI 4: = TT [A;n—Ajn?-8+ 1] = n(TT A;){1+ 0(n-)}, 
TITY; = IT[A; ae n*(T] A;){1+ O(n-8)}. 
Hence, TL 3c? = nT] A;) {1+ O(n-F )}, 
and so ss "tie 14-00) 
Git 


ITA) yt + O(n-8)+ O(n-42-P)}, 
by Lemma 2. 
We see from this lemma that 


p(n) < Cnv. 


Known results 
1. The results in this section are not new, but for convenience 
we give them here in the form in which we shall require them. 


Lemma 5. If m and r are integers, m being positive, then 


r-m 
pl| < Cqem*(mq-*?+¢'). 


h=r+1 








It is known that* 


r-m 





9 
p™| < Cq*m*(m+-m?q-!+-q) < Cq*m*(mq-*+q?*)?, 
h=r+1 
and the result follows at once. 

Lemma 6. We have |S,| < Catt. 
This is Lemma 5 with r = 0, m = q. 


* Landau, Math. Zeitschrift, 12 (1922), 226, Hilfsatz 2 
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4.2. LEMMA 7. If s > 5, then the series for S(n) is convergent, 
|S(n)| < C, 
and |S(n)—S(n, n#4-P)| < Cn-t0-P, 
In Lemma 6, take e< }-—s"", 
Then we have 
|A,(q,)| < g~* > |S,|* < Cg" < Cq-¥. 
p 
Hence > |A,(¢.)| < C > q-* < CAl-#, 
q> q>a 
This shows that the series for G(n) is convergent. If we put A = }, 
we have S(n)| < C; 
and if we put A= nit-P), 
we have 
|S(n)—S(n, nt4-P)| < Cnia-1s1-B) < Cn-*a-P), 
1.3. Lemma 8. If s > 5 and n is sufficiently large, then 
S(n) > C. (4.31) 
If s >5, the number of representations of n as the sum of s 
er re {1'(}s)}-'!*nls-1S(n) + O(n'*-1~), (4.32) 


where c = c(s) > 0. Again, if s = 5, we havet 


bie n\ 1 
S(n) > C > (=) 


where m runs through all odd numbers prime to » and ( is the 
m 


)-+ 


n 


Legendre symbol. Since 


we have, when s = 5, 
ie 8) 
S(n) > C(1 —< 1 C(2—1n) > C 
O(n) > — Fe = O(2—% : 
(2t+1) , 
t=1 
* No proof of this for general s appears to have been published, but one 
can readily be constructed by the Hardy-Littlewood method for the solution 
of Waring’s problem or by the Vinogradoff method. Hardy (Trans. American 
Vath. Soc., 21 (1920), 255-84) has proved that the number of representations 
of m as the sum of s squares is exactly 
{T'(4s)} tnt S(n), 
if3<ec 8. 
+ Hardy, loc. cit. 
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Ostrowski* has shown that, if (4.31) is true for s = 5 and (4.32) is 


true for s > 5, then (4.31) is true for s >5. This completes the 
proof of the lemma. 


4.4. Lemma 9. We hans 


asl 
'm=1 ! 
When m 
2m? — 2(m—1)* = m+ O(m-), 
and, by summation, 


273 > m-*- > O(m-*) -% m+0(1 


m=1 m m 


Behaviour of f;(x) and y%,,;(x) on & 
5.1. Lemma 10. We have 
p,,(")| << Cn-tg-h+¢ min(n!-B, |y|-1). 
We have 
Ni; = [A,;n—Ajnt-8+ 1] = A;n+O(n!-*) > Cn, 
if m is greater than a certain », where m, is a number of the type C. 
Hence, in this case, 


Again, if we write 


w(j) 7 z e27imy 


m=N; 


w(N;—1) 


then 


and 


Hence, 


8,| (% 

Ib, «(x) | ie | r 
q\y| i 
< Cqe + ly|(Ni)+t < Cg tent ]y|> 


* Math. Zeitschrift, 9 (1921), 28-34. 
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On the other hand, if n < mp, then 
eek yli<3=C, 
lf, «(2) 4(N;—Nj+1) < Cn < Cn, = C,7 
and the lemma is He ae true. 
5.2. Lemma 11. We have 
| f(x) —,,e(x)| < Cq'+*max(1, n'-8 ly). 


Let us write 


ajj=— > »”, oj) = 3¢78, > m+. 
1<hF<j 


<h?<j 1<m<j 


Then u(j) — > p+ y pl. 


1<h<ati'/q) ati*/a<h<{3*) 
The second sum has less than g terms and so, by Lemma 5, is less 
in absolute value than Cq!**. The value of the first sum is obviously 


S[7*/q]. 
Hence \w(7)—[ j*/q]S,| < Cq'*«. 
Also, ILz*/a|S,— (7? /Q)S,| wa \S,| < Cqit. 
By Lemma 9, ln j)—(J#/)S,| < CS, /q| < C. 
Hence, |u(j)—v(j)| < Cqtt. 
Now we have 


Then, finally, we have 


fale) c(@) | Siw 
Ni 


+ lu u(Nj—1)—v(Nj—1)|+ |u(N)—o(N)| 


< Og<[1—Z(N,—N-+1] < Cgh<(ntFly| +1), 


5.3. Lemma 12. If |y| < n8-, then 
TH fie) TI ope] < Cate ah +-n8-PH-2y- 10-2, 
If nP-1 < |y| <4, then 
[TT fe2)— TT ¥p.i(a)| < Cqs<(qinet—P)|y|2+-g-+nie—9-B ly |2-*). 
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If s = 1, the lemma follows at once from Lemma 11. If s > 1, 
let us write f= Ups +¥, i 
Then we have 
Il f- Il Pp _ Il (bi+ Ha— Il Bois 
a sum of 2°—1 terms. Each term consists of s factors, not more than 
s—1 of which are of the form ¢%,;. By Lemmas 10 and 11, a term 
which contains r factors of the form ¥, ; and s—r of the form ‘¥, ; is 
less in absolute value than 
Cn-¥qis+8€ min(n"4-P), |y|-")max{(n!-F |y|)s—, 1}. (5.31) 
When |y| < n®-1, this is less than 
Cnre- Bigs dies 


and hence 
s—1 


[TT file) — TT 4,2(@)] < Cates > (nt-Pqoty 
< Cq**(q#8-+-n4-PXs—Dg-Hs-2)), 
When |y| > »®-1, (5.31) is less than 
O(g* + |y|n*-P)*(q-1 |y| 2-4 +8), 
and hence 
[TT f:e)— TT se) < C(gt**|y|n*-P){1 + (gly | PnP} 
< Cq*(qisn84- B) y|>+q-# +1 y |2-8nt8-s)-B), 
5.4. Lemma 13. On M’, 
ae | filee)| < Cne(qly|)-6 
We write, _ ni-B, — exp(2zi{(p/q)+y}); 
1 ] 
—< yl < : 
qim’ ~ , q|m* | 
qiy| < 2. 
Then, by the theory of the Farey series, we know that there are two 
integers p,, g, and a number y, such that 


and we have (5.41) 


) ? 
Pris y, =Piy, 
1 qd 


9 
0<a<4a< 


q\y|’ 


First suppose that 
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Then ly| ly| 
iy,| => 


P= Pp q= %, y= oe ’ 
y ; . % 24, 


which contradicts (5.42). Hence, 
Pp = I Pr 
Pf, 
7 "2 
Again, suppose that Gy < [m'*]. 
Then there is an are Mt, corresponding to p,/q, in our original Farey 
dissection of the circumference &. Since, by (5.41) and (5.42), 
qiy|_ 1 
24, 2q,|m'* |] 
x lies within this are Mt, (that is, not at an end-point). But this is 
impossible, for x lies on the are M. Hence 
q, > [m'*], 
and so q, > mt = nit-P), 


"| < 


We now write 
p= exp2ripiay) (9) = Sp (Mi <5 <M). 
Then u,(j) contains at most 
N}—Ni#+1 < Cni-8 
terms, and we have 
iuy(J)| < Caenet-P(nt-Bqy4-+-gh) < Cqhsnee-P, 
since q, > nta-B) > ni-B, 


Hence, 


From this we have finally 
Ni-1 
file)| <|I—=] > Jeey(5)|+ lee) 


A j=Ni 


<¢ negt( 1—=|N,—¥,)+1] 
Pi 
< Ontq}+«(n!-B\y,|+1) < Onsg}*«(n'-Pg;?+ 1) 


< Oneq}** < Cn*(q\y|)-*~. 
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Final lemmas 
6.1. So far 8 has denoted any positive number less than unity. 
We now introduce the restriction 
0o< B <a 
and write 6=a—fB> 0. 
Finally, we choose « so that 
e < 8/(2s+1). 
Lemma 14. Jf s >5,0< B < a, and 
E, = nB-1 ¥ E, = n*e-%) Bis—2) '¥ g—ie-2), 


yw wm’ 


E, = nie-2- B) ps q-te+4), E,= nis—4)—i(s—DB D gts), 


a 
we have E, < Carte ¢: = %.2.5, 4), 
There are at most g arcs M’ associated with a particular g. Hence, 
qd 


if w is any real number depending on s and f only, we have, when 
ee ae 
oO as Yq < > qett < Cn+20-B), 
MW’ q<nt—s) 
and, when w < —2, 


Yqe< Hh< FY gq'i< Clogn < Cn‘. 


on] 
wi’ q<ni(1—B) q<ni(—s) 
Using these two results, we find that 
E, < Cni#a-P) 
E, < Cni6 138) 
Dees Onis-3) B(s—2)+e 
E, < Cn0-8) 


V 


I 


a < Cn 3s—21-B)+€ 
E, < Cniea 38) — Cnts-B) 
Hence it is sufficient to prove the inequalities 
$8(1—B) < y—28 


VVAWV 


\V 
ou 


ca —— as ww 
7 oe 


| 


co 


4 
1(5 < y—28 
}(s—3)—P(s—2) < y—28 
}(s—2)(1—B) < y—28 
To prove the first we have 
y—48(1—B) = H{(s—4)—B(3s—4)} 
= }{(s—4)—a(3s—4)}+ }8(3s—4) > 46 > 28, 


V WV 
@ 


since, by definition, (3s—4)a < (s—4). 
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For the second, when s = 5, we have 
y = 3—48, B = a—, 8<a=j, 
y—4(5—138) = }(1—38) = (1—3a)+ 35 = 2a+35 > 118 > 28. 
For the third, 
y—Hs—3)+B(s—2) = JB = }—a +8 > a8 > 28, 


since « < }. For the fourth, when s > 8, we have 


y—}(s—2)(1—B) = 3{(s—2)—2B(2s—1)} = 438(2s—1) > 58 > 28. 
3.2. Lemma 15. If s > 5 and 0 < B < a, then 
I, = > } {TL f@)— IT 4,4(@)| {dx | < Cnr, 
By Lemma 12, we have 


| | II f(x)— Il $,,i(x)| \da| 
gs nf-l 
Cor (a+ ne BXs—Ng -i(s—2)) | dy +. 


0 


q-in-i(1-3) 


+qisnst B) | ys dy +- q-*+1ni@-s)-B [ y2-s ay} 


n8-1 n-1 
< Ons<(nB-lq)s + nis—3)—Bis—2) —Hs—2)_1 Xs—2X1—-B)g—Xs+4)) 
since qs < n™. 
We now sum over all the arcs It’, and we find 
I, < Cn*(E,+E,+E,) < Cny-®1e+0 < Cnr, 


6.3. Lemma 16. If s > 5 and 0 < B < a, then 
I, => | |TD faa) |dx| < Cnr. 
W 
mn” 


By Lemma 13, we have 

| TI f:(2)| da| < On<q-'s-8€ | ly |-#8-8€ dy 
; a in-K1-8) 

wie COns<q-*#-8€(qinta-B)ts+8e—-1 < Cin2s¢+4.—-BXs—2)9g—He+4) 
We see that every M” is associated with an M’ with the same value 
of g. Hence, 

| Pe On2se+i1—-Bxs—2) $° g—Ho+4) 

Ty 


= On*E, < Cnyv-B+2s+e — Cny-, 
E 
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6.4. Lemma 17. If s > 5 and 0 < B < a, then 


Is z J [TT Yo] fda < Cnr 





We have, by Linea = 
pp (a)| < Cn-tg-t+*|y|~. 
Hence | ITI Ppi(X )| )} |dar| a C: n- isq—ts+se ( or dy 


RW’ q-tn-#(1—B) 
= Cq*«n-*q -48(gin§—B))s—1 
< Onse+ts—4)- i(s—1)Bgk(s—4), | 





and so I, < On*E, < Cnv-®+0+e < Cny-, 


Proof of Theorems 
1. THEorEM 1. If s > 5 and 0 < B <a, then 
T 
r(n) = 
) = spar 
where c = c(s,B) > 0. 
The coefficient of 2” in 


\(n)n¥+-O(nY-), 


2 {II by i(x)}, 
where p runs through all primitive gth roots of unity, is 
n) > q-*S5p-” = p(n)A,(q, 2). 
p 


Hence, the coefficient of x” in 


> > (II %,,() 


a<ni(1-8) p 
is aoe 
From this we have, by , theorem, 


t; { (TA —_ ETT bpalo)) Se 


a<ni(1-3) p 


r(n)—p(n)S(n, ni1-P)) 
Qri 
t 


= Js-rvse $e f esn a 


“ae 
om Tr a 
-> | {IT ¥,(~) = =) 


wm’ 
K-—M 
Then, by Lemmas 15, 16, and 17, 


Ir(n) —p(n)S(n, n¥0-B)| < S (Lt b+ h) < Cnr, 
at 
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Finally, by Lemmas 4 and 7, 
r(n) = p(n)S(n, n¥0-P) +4 O(ny->) 
= p(n) S(n)+O(nY-44-P) + O(n) 
TnvS(n) . 
— sre Tay?” 
where G@ is a number of order 
O(ny-) + O(ny-*2-P) + O(ny-F) + O(ny-1+8) = O(ny-). 
.2. THEorEM 2. If s > 5 and 0 < B < a, there is a number 
Mp Be Maks, B, Ag 5.005 Migs Miggines Migy Aay--+y Mad 
such that, for all n > no, there is a solution of the equation 
n = mi+m2-+...+m? 
in positive integers satisfying the conditions 
A,;n—Ajnl-B < m? < A;n+A;,n'-? (s = 1, 2.,..., 6). 
This is an immediate consequence of Theorem 1 and Lemmas 3 
and 8. 





‘ 


THEorEM 3. If s >5,0<B <a, and n > 1, there is a solution 


of the equation ‘ . ° 
f d n = mi+m3-+...+m? 
in integers such that 


|A;n—m?| = O(n1-*) (¢ = 1, 3....,8). 
For n greater than some n, = C this follows from Theorem 2. If 
l<a<, 


n can always be expressed as the sum of s squares, some of which 
may be zero. Also, 


|A;n—m?| < 2n < C = O(n'-*). 








THEOREMS ON POLYNOMIALS IN A GALOIS 
FIELD 
By R. E. A. C. PALEY (Cambridge) 
[Received 24 June 1932; revised form 20 November 1932] 
1. Let p be a prime and let &(y) be a polynomial 
Ay tA, Y+a,y?+...+AyYy™; 
where dy, 4, @,..., @,, are integers lying in the range 0, 1, 2,..., p—1, 
does not vanish except possibly in the case m = 0. We say 


and a 


m 


that é(y) is a polynomial reduced modulo p, and of degree* m, and 
we identify é(y) with polynomials of the form 


Aj+a,y+a,y?+... +4, y™+0.y"4t+O0.ymt?+...+0.ymt*, 


Suppose that é(y), n(y) are the two polynomials a)+-a, y+...+@,y™ 
and b)+b,y+...+6,,y™ (we may assume, in virtue of what has 
already been said, that they have the same number of terms). We 


— E(y)Lnly) = Cote yt---+emy™ 
where c, lies in the range 0, 1, 2,..., p—1, and c, = a,b, (mod p), 
(r = 0,1,...,m). We write —&(y) = 0—é(y), and observe that when 
p = 2, then &(y) = —&(y). 
We define &(y)n(y) similarly in the following way. We write 
E(y)n(y) = dotdyy+...t+deny, 
where d, lies in the range considered, and 
d,, = a)b,+-a,b,_,+...+4,b) (mod p). 
If € and » are of degree m and n respectively, it is not difficult to 
see that the degree of &y must be m-+-n. 

The theory of polynomials of the type defined, and of the more 
general class of polynomials with coefficients in a finite Galois field, 
which we discuss later, has been systematically carried out, and 
theorems analogous to those of ordinary algebra may be obtained.+ 

We define as irreducible a polynomial z(y) of degree h > 0, which 
cannot be expressed as a product of two polynomials each of degree 
greater than 0. 

* The degree of the polynomial 0 is not defined. 


+ See, e.g., L. E. Dickson, Linear Groups, with an Exposition of the Galois 
Field Theory, 1901, pp. 1-54. 
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Now let z(y) be an irreducible polynomial of degree h, and let O(y) 
be a rational integral polynomial. We can write 


D(y) = m(y)A(y) +49 +4, y+4, 9+... +a, 4", 
where A(y) is a polynomial of the field, and the remainder is a poly- 
nomial of degree h—1 with coefficients in the required range. This 
class of polynomials (of order p”) form the residue class mod x(y) for 
polynomials of the field considered. We call them the marks of the 
finite Galois field GF(p"). We denote by « a mark of the field. 

Now let 6(x) be a polynomial with coefficients « lying in GF(p"). 
Write O(x) = eg +e, 7+e,27+...+¢€,,2™. 

If «,, does not vanish, the polynomial 6(x) is said to be of degree* m. 

We may now define addition, subtraction, and multiplication for 
our new polynomials 6(x). Let 

OD(ax) = V4 Vat Matt + ellgm, 
O2)(x) = €) +€)(x)+€Pa2+ + €2am; 
we may again assume that 0%, 6@ are of the same degree. Then 
OY (x) +O (x) = egte,xt+enr+...t¢,, 2”, 
where €p, €),---, €», are marks of the field, and 
e, = €?)+e (mod a(y), mod p). 
And OD (x)OP(x) = eg tejrate,at+...te/ a”, 
where €/, €},..., €, are marks of the field, and 
ef = Me) free +... +e) (mod z(y), mod p). 
We observe that 
degree 0(x)0°(x) = degree 0(x) + degree 0°(x). 

We define as irreducible a polynomial w(x) of degree greater than 
0 which cannot be expressed as the product of two polynomials of 
degree greater than 0. 

We use Greek letters generally to denote polynomials, and the 
letter w to denote an irreducible polynomial. We call primary a poly- 
nomial when the coefficient of the highest power of x is the mark 1. 
[t may be shown that every polynomial of degree greater than 0 may 
be expressed uniquely as the product of primary irreducible poly- 
nomials and one mark of the field (which, of course, is 1 when the 


original polynomial is primary). 


* Again, when every coefficient ¢€ vanishes, the degree is not defined. 
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Our first theorem concerns the number of ways in which a poly- 
nomial can be expressed as a sum of two nth powers. We prove 
that, given and m, we can always find polynomials which can be 
expressed in m different ways as the sum of two nth powers. 

We next go on to consider the analogue of Waring’s theorem. The 
proof of Waring’s theorem is not difficult, and furthermore in certain 
exceptional cases we are able to obtain some quite remarkable Waring 
numbers, provided that we admit the algebraic sum of powers of not 
necessarily primary polynomials taken positively or negatively. 

2. THeoreM I. Let n, m be two arbitrary positive integers. Then 
we can choose a polynomial v which can be expressed in at least m 
different ways as the sum the nth powers of two primary polynomials. 

We observe that we may assume without loss of generality that 
n is divisible by p. For if v can be expressed in m different ways 


y= a”-- 


in the form n4 Bn. (2.1) 


then v? = «””-+-8"” for all the solutions of (2.1) and conversely, and 
hence v? can be expressed in m different ways as the sum of two 
(np)th powers. Next we observe that we may assume that n is of 
the form p*—1, for we can always find a polynomial of this form 
divisible by any n prime to p. 
We use the identity 
{E(EP*-1 4 yh)" 14+ fn (EP* 1+ gy 
= (EP Pty = (GP) (gr. 

We observe that if the degrees of £ and y are unequal, and if £, » are 
primary, then the two polynomials on the left-hand side of (2.2) are 


also primary. Now suppose that it has been established that we can 


find a polynomial v, which can be expanded in N different ways as 
the sum of two (p*—1)th powers 

y = af*-14 Br) = og*-14 pet =... = of 14+ PR-1, (2.8) 
such that the numbers a4, B;, a, Bo,..., &y—, By_ are all primary p*th 
powers, while ay = yBy, y being an arbitrary primary polynomial 
which is not a p*th power, and ay, By also being primary. If in (2.2) 
we write = y, 7» = 1, then we observe that, for NV = 2, the equa- 
tion (2.3) is satisfied by putting 

a=", A= = ry +1), B= y+. 

If now we put 


=o", =p (r= 1,2....,N), 
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then 
ye — oP 14 Bie) = ae tt Bet — .., = a 1 Be, 
while 
vid pk} +B% pr-1 = pel (BR) p*-1 
_— {ay xP, pr-11+ Bp 1)}0* AL {BA (av 14 By -1) je" -1 
y* 


— ! fee = rp “ 
—= On 41 N+ 


We observe that ay ,, = - ,1, 80 that the pair a‘, ,,, By, is certainly 
distinct from the pairs a;, 8} (l<r< N), the latter being all p*th 
powers. Also all the numbers a, B; (1 <r < N-+1) are primary. 
Thus, if (2.3) is satisfied for N = M, then, by appropriate choice of 
v, it may be satisfied for VN = M+1. Finally, since it is satisfied with 
N = 2, the desired result follows. 

If in (2.2) we make » primary and keep it fixed, and allow € to 
run through all primary polynomials of degree different to that of 
», then we obtain the theorem 

THEOREM II. We can obtain an infinite number of primary poly- 
nomials which can be expressed in an infinite number of ways as the 
sum of the nth powers of three primary polynomials. 


3. Before discussing Waring’s problem we need some preliminary 
lemmas. 
LEMMA l. 
l 
1 
l 


] 
The lemma is immediate. 


LemMA 2. Let n = p™+p™-+...+p", where none of the numbers 


Ny, Nos---5 N, occurs more than p—1 times. Let h(p—1)k’ > k. Then 


v” = 0. (3.1) 
degree v=k’ 
v primary 


We observe first that in order to establish (3.1) it is sufficient to 
consider the same function summed over all polynomials of degree 
less than or equal to k’ and with constant term unity. For we 
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obtain* this set of polynomials v if we change x into 1/x and multiply 
by x, so that the general term of the sum (3.1), which is 
(2+, 2*-1+...+€,)", 
becomes (e, 2 +e,_, 2" 1+...+6,27+1)", 
where all or any of the marks ¢,, €,..., €,, may vanish. 

We prove the lemma by induction. It may easily be verified for 
k’ = 1. Now suppose that the lemma has already been established 
for k’ = m, and consider a number n» for which the least value of 
k’ism+1. Then 


= (vex), 


degree vom-+1 degree vom € 


constant term=1 constant term=1 
where the internal summation is over all the marks of the field. The 
last sum may be written 
‘ No r r(m+1),,.n—r| 
degree vom (> r p3 ul ‘ }? 
constant term=1 

where “e, denotes the residue (mod p) of “C,. Now the coefficient 
of x”"+) in the above sum is 

U n-r r 
* ‘nat m 2 fs 

constant term =1 

and vanishes unless r > 0 and p*—1 is a factor of r. Now 

n= p+ p™t...tp™; 
let us suppose that 

r= pit+pe+...tpln, 

2—?T = p™--p™--...-- pm, 

and that neither of the sets (/,,/5,..., 1;,), (2, mg,..-, m,,) contains the 
same number more than p—1 times, while k,+-k, > k. Then some 
integer must be contained at least p times in the set (J, 1s,...,;,, 
M1, Mbp,...,m;,.). Let us suppose, as we may, that 


L, — l, =... = = Mm = M =... = My, 
and let us write 
= prt plritpliet...tpls 
n—l = pment pm itt... ppm, 
Now the number of times that p is contained as a factor in the 
product 7! is 


ky 
> (p+ p+... +1), 
s=1 


* The underlying idea, which is due to Mr. Davenport, will be developed 
elsewhere. 
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with similar expressions for the number of times that p occurs as 
a factor of (n—r)!, r’!, and (n—r’)!. Thus the number of times that 
p occurs as a factor in r’!(n—r’)! exceeds the number of times that it 


occurs as a factor of r!(n—r)! by 


prtprat...+14 
k, 
t 


k, 
(pet phe? +... +-1)+ > (pm pm2+ ...+1)— 


1 s=p—t+1 


k, k, 
— > (prt ptt... $1)— > (pm tt pm 2+... $1) 
s=1 8=1 


= phtpht+...+1—p(pr+ pr?+...+-1) 

=a. 
The number of times that p occurs as a factor of n! is not less than 
the number of times it occurs as a factor of r’!(n—r’)!, and thus 
exceeds the number of times it occurs as a factor of r!(n—r)!. Thus 
we have shown that “e, vanishes unless k,+-k, = k. 

Now since the third factor of (3.2) vanishes unless p”—1 is a factor 
of r, and this implies that k, must be divisible by h(p—1), it follows 
that (3.2) vanishes unless k, < k—h(p—1). But this means that the 
corresponding value of k’ can be at most m, and, since the lemma is 
supposed to have been established for k’ = m, it follows that, when 
k, < k—h(p—1), the second factor of (3.2) must vanish. Thus one 
of the three factors of (3.2) must always vanish, and this proves 
the lemma, 

LEMMA 3. > yo 2 Q, (3.4) 

degree v=’ 
v primary 

Suppose that the sum (3.4) vanished. Then we should have, by 

Lemma 2, m— tcunc wei 
oie Pg : (0<m<p ), 

v primary 

and thus the determinant of Lemma 1 would vanish with n = p’*, 

and « running through the primary polynomials of degree v. This 

would give a contradiction. 

4. Suppose that w is an irreducible polynomial of degree n. Then 
it is known* that, for any a, «?”"-! must be congruent to 0 or 1 
(moda). It follows that the sum of any number of (p’"—1)th 
powers can only be congruent to 0, 1, 2,..., »—1 (moda). If, there- 


* Dickson, loc. cit., Chap. I. 
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fore, we ask what is the smallest number of (p’”" —1)th powers as the 
sum of which all polynomials can be expressed, the question has 
clearly no finite answer. We can, however, prove the following 
theorems. 


THEOREM III. Let n be any integer prime to p. Then all polynomials 
which are not barred by congruences from being the sum of nth powers 
can be expressed as the sum of the nth powers of at most K polynomials, 
where K = K(n) is a number which depends only on n. 

It will be sufficient to prove that all polynomials of the form 
I'v-+-A (v arbitrary) can be expressed as the sum of at most k nth 
powers, where I’, A are polynomials which depend only on n, and 
k. also depends only on n. For this will show first that the only con- 
gruences relevant to sums of nth powers can be those (mod IT). 
Suppose that the possible residues (mod I’) are ay, a,..., ay. Then 
it is possible to find polynomials v,, (1 < m < M) such that each of 
the polynomials T'v,,—A+«,, (1 < m < M) can be expressed as the 
sum of nth powers. Let k’ = k’(n) be the maximum number of nth 
powers required for the expression of the finite set Tv,,--A+a,, 
(l<m< UM). Since ['(v—v,,)+A can be expressed as the sum of 
at most k nth powers, it follows that Tv+«,, (1 << m < M) can be 
expressed as the sum of at most k+k’. Since the last expression 
includes all polynomials which can be expressed as the sum of nth 
powers at all, the desired theorem follows. 

To establish the assertion made above, we consider sums of the 


form Y  (v™y+am™)", 4.1) 


degree a=1 
a primary 


where /, m,, and m, (mz > m,) are positive integers which remain to 
be chosen (m, may vanish). We first choose m,, J so that 

(n—1)m, < p” < nmzg, 
and write pu—1 = (n—1)m,+m,. 


Now in the sum (4.1) the coefficient of v” (r > 1) is of the form 


ne, a” (r’ < pu— ] :. 
degree « l 
a primary 
and so vanishes by Lemma 2. On the other hand, the coefficient of v is 
Al_ 
yP 1 


degree a=1 
a« primary 


which by Lemma 3 is different from zero. Thus the sum (4.1) is of 
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the form I'y+-A, and Theorem ITI then follows in virtue of what has 
already been said. 

THEOREM IV. Let n be prime to p. Then all polynomials which are 
not barred by congruences from being the sum of nth powers can be 
expressed as the sum of at most K’ primary polynomials, where 
K’ = K’'(n) ts a number which depends only on n. 

Let I’ denote the number of Theorem III. Let y be expressible as 
the sum of nth powers. In virtue of Theorem III we may suppose that 
y = BI+Bi+...+Bp- 

Then we can choose a polynomial é such that £1 is primary and of 
degree sufficiently large to ensure that [+8,, £[+-,,..., 6+, are 
also all primary. Since 

y’ = (€0+8,)"+ (EV +B2)"+... + (EP +B)" (4.2) 
is congruent to y (modI"), we may write 

y—y = Vy tret-- trp); 

where the numbers 1, v9,..., v, are all primary, and of degree suffi- 
ciently large to ensure that all the terms of the sum (4.1) with 
V = V4,Vg,...,¥, are primary. Then 


p 
| i y+ 2 (I'y,+-A), 
=1 


and, substituting for y’ and (['v,+A) by means of (4.2) and (4.1) 
respectively, we obtain the desired result. 


5. We will now investigate further the case in which n = p’™-+-1. 
We observe first that 
hm hm hm hm 
(var ye" (ay 1)" +1 — yr (ay) 


= (ae — a2) + app _] a Tv+A, 


hm . 1 


where, in virtue of a known theorem,* 

= Il wo. (5.1) 

a =. |. 

Thus, if we allow plus and minus signs arbitrarily, we have estab- 
lished the assertion made at the beginning of Theorem ITI, with I of 
the form (5.1) and k = 4. We will show that, with the notation 
of that theorem, if we still allow plus or minus signs, then k’ = 1 
or 2. We first need some preliminary lemmas. 


* Dickson, loc. cit., pp. 15-16. 


















60 R. E. A. C. PALEY 

Lemma 4. Let w be one of the factors of (5.1) and let degreew = r. 
Then the highest common factor of p"”—1 and p"™-+-1 is 2 when p > 2, 
r\m, and of the form (p""—1)/(p" —1) otherwise. 

First suppose that r is a divisor of n. Then the highest common 
factor is clearly 2 when p > 2, and 1 when p = 2. Ifr is not a factor 
of n, then r = 2r’ where r’ divides m and m/r’ is odd. Then the 
highest common factor is p”’”’ +1 = (p’”—1)/(p"”—1). 


Lemma 5. Let l < p’"—1, Then 
of = 0. 
degree a<r—1 
« not necessarily primary 
The proof is similar to that of Lemma 2. 
Lemma 6. Let w be an irreducible polynomial of degree r. Then we 
can find a polynomial « of lower degree than w (x 4 0), such that 
v £1 (moda) (l = 1, 2,..., p’”—2). 
Suppose the lemma were not true. Then, as with ordinary algebra, 
we could show that there exists a number / < p’”"—1, such that 
x’ = 1 (moda) (all a, (w,«) = 1). 
Thus addition gives 
a! = —1 (moda), 
degree a<r—1 
x not necessarily primary 
which contradicts Lemma 5. 
Lemma 7. Let w be an irreducible polynomial of degree r. Let 
‘Ul’ = p’—1. Then the necessary and sufficient condition that the con- 


gruence 
g p! = a (moda) 


is soluble, is a’+1 = gy (mod a). 

Let « be the residue of the last lemma. Then the set {a} (J = 1, 2...., 
p’"—1) runs through all the residues (except 0) (modw). Lemma 7 
now follows by the familiar arguments of ordinary algebra. 


Lemma 8. Let mw be one of the factors of (5.1), and let degree w = 1, 
where, unless p = 2, r is not a divisor of m. Then the necessary and 
sufficient condition that the congruence 

pe'"+1 = B (moda) 
is soluble for some poiynomial py, is 
pe” = B (moda), (5.2) 


where r’ denotes the number of Lemma 4. 
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In the case where p = 2 or when r does not divide m, (5.3) follows 
at once from Lemma 7. 

Since (B,+8.)”"” = pr’ +pP"”, 

(5.2) is also the condition that the polynomial f is congruent (mod w) 
to a sum of nth powers. 

Now let p = 2. We prove the following theorem. 

THEOREM V. Let p = 2. Then, with the notation of Theorem III, 
K(Qhm1+] ) <= 3. 

Let y be an arbitrary residue (modI’), which can be expressed 
(modI’) as the sum of nth powers. Let w,, w»,..., wy denote the 
factors of (5.1), and let y = B, (modm,) (l = 1, 2,...,N). We can now 
find polynomials p, such that np? = B,(moda,). Let p = yp, (modm,) 
(J = 1,2,...,N). Then p” = y (modT), and thus k’ = 1. This proves 
the theorem. 

When p > 2, the result is not so elegant, since, when r is a divisor 
of m, the condition (5.2) now becomes equivalent to the condition 
that « should be a quadratic residue (moda).* We may, however, use 
the methods of Theorem V to prove the following theorem. 

THEOREM VI. Let p> 2. Let y be a polynomial which can be 
expressed as the sum of nth powers, where n is prime to p, and 
n= ph™+1. Let n’ be the greatest odd factor of n. Then y can be 
expressed as the algebraic sum of at most six nth powers taken positively 
or negatively, or as the sum of at most five n'th powers. 

Let m1, @»,..-, 7y denote the factors of (5.1), and let y = B, (moda) 
(J = 1,2,...,N). If degree w, is not a divisor of m, we can find a 
polynomial p, such that yj = 8,;(modm,). If degree w, is a divisor 
of m, then we can write 8, = B/”—£{?) (moda,), where Bf”, Bi?) are 
both quadratic residues (modw,). We have only to observe that, 
since p > 2, we can write 2y{ = B+1(modam,) and 2n/? = B,—1 
(moda,), and a ({?)?- pe 2 = B,(modw,), and we p He only 
to put BY) = (nf”)?, Bi? = (nf)? Now the equations (y{?)" = Bf 
(mod a;,) and i n = Bf?) ( walla ) certainly have acletitiek ; finally 
we write 


p® = pw, (moda) degree w, does not divide m), 


( 
p® = pf) (modaw,) (degree w, divides m), 
(degree w, does not divide m), 


yp = 0 (modam,) 
uy?) = pi? (mod a,) (degree w, divides m), 


* If p = 2, h = 1, then all polynomials are primary, and so K’ (2"+-1) < 5. 
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then clearly (u™)"—(u)" = B,(modam,) (J = 1, 2,...,N), and thus 
(u)"—(u)”" = y (mod). It follows that k’ < 2, and this proves 
the first part of Theorem VI. 

For the second half of the theorem we observe that, when degree 
aw, is not a divisor of m, then we can find a polynomial p, such that: 
ui = B,(modw,), and that, when degree w, divides m, then the 
highest common factor of n’ and p’”—1 is 1, so that pt’ = y (moda) 
has a solution for all choices of 7. Thus we can find a polynomial 
pw such that np” = B,(modam,) (l = 1, 2,...,N), or, what is the same 
thing, »”” = y (modT’). Thus we can express y as the algebraic sum 
of 5 n’th powers in the form 

Ley LEN ber ey ey 
~ (46) +(4€2)” 4 (LE) + (LE ELE", 
since n’ is odd. ~ i y can ty expressed as the sum of at most 5 
n'th powers. 

6. We confine ourselves now to the case where p = 2,h = 1. We 
prove the following theorem: 

THEOREM VII. Letn >1. Then K(n) > 3 

We may assume that n is odd. Let a be an irreducible polynomial 
of degree r. Let a”+ 8" =m. We distinguish two cases. 

(a) a+B = 1.. 

In this case degree « = degree 8 = | (say). Then it is not difficult 
to see that 

a" + BY = g"—-1+ q"-2B+ |. + Bn-t 
= a4 (a+f)(a"+o"9B+...48"-8)B. (6.1) 
The degree of the second member on the right-hand side does 
not exceed (n—2)l, while that of the first is (n—1)l. Thus 
degree (a”+-£”") = (n—1)l, 
* we get a contradiction if r is not Pi isible by (n—1). 
) a+f = ww. 

ad must either have degree « = r, degree 8 < r, in which case 
degree («”+ 8") = mr >, or else degree a = degree B > r [since 
there are only two marks of the original Galois field, if «, 8 are both 
of degree r, it must follow that degree (a+) <r]. In the last case 
the argument used above shows that 

degree (a”-+ 8") = degree (a+-8)+(n—1) degree a > r. 


Thus again we have a contradiction. 
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In the special case n = 3, we can sometimes improve on Theorem 
[V. The argument of that theorem gives 
Pr = at+a = a(x+1)(x?+2+1), 
and the possible congruences of a cube with respect to these factors 


as a® = 0 or 1 (modz), a? = Oor 1 (modx+1), 
a? = 0 or 1 (modz?+z+1). 

We prove the following theorem: 

THEOREM VIil. Let y = 0 or 1 (mod2z?+-2+-1), y ¥ 0 (modz?+2). 
Then y can be expressed as the sum of four cubes.* 

The possible congruences of y (moda*+<z) are 1, 2?+2a+1, 2%, 
2+1, 2+a?+a, 23+a?+2+1. We treat the six cases separately. 
We have only to observe that 
(va +-a?)8 + (va 1)8+-(v(~+ 1)+2?)8+(v(x+ 1))® = (at+a+1 
(var +-a2+1)8-+ (va)>+ (v(x 1) +22)®+(v(e+ 1)? 

= (a*+2)(v+1)+2?+2+41 
(v(~+1)+-a+1)8+(v(a+ 1)-+2)8+ (v+a?+ 1)8+r4 

= (a4+2)(v+2?+ 1)+24 
(v(x+1)+1)®+(v(x+1))8+ (+224 1844 

= (2*+2)(v+2?+ 1)+2°+2°+2 
(va+ax)e+(vat+ta+1)8+(v+a?)8+r = (a4+2)(v+27)+a3+2?+2+1 
(var-+1)8+- (va) + (v-+-a2)8-4+08 = (etfa)(vta%)+a3+ 1. 


* In this field all polynomials are primary. 





on THE FUNCTION 1/(1+ it) 


By E. C. TITCHMARSH (Ozford) 
[Received 7 August 1932] 


1. One of the unsolved problems in the theory of the Riemann 
zeta-function is that of determining how large are the greatest values 
which |1/¢(1+-7t)| takes as ¢ > oo. Writing 


1/Z(1-++it)| 


=< im 1 

een gd loglog t (1) 

it was proved by Bohr and Landau* that p is positive. It is not 

known that , is finite; but, if the Riemann hypothesis is true, then 

» is finite, and in fact i (2) 
6 

where '—= —2, (3) 
= 

y being Euler’s constant. In view of this it is of some interest to 

try to determine a definite lower limit for y. 

The argument of Bohr and Landau of course leads to such a lower 
limit, but not to one of much interest. On the other hand, it was 
proved by Littlewood that, on the Riemann hypothesis, 

p> RO. (4) 

Here I prove without any hypothesis that 

p> b. (5) 
Comparing (2) and (5), we see that (on the Riemann hypothesis) it 
is only a factor 2 which remains doubtful. The state of affairs in 
this problem is now exactly the same as in the corresponding problem 
for ¢(1-+-it), where similar results hold with c = e” instead of b. 

The method is suggested by my proof{ of the corresponding 
theorem for ¢(1-++-it). That, however, depends on the fact that the 
coefficients in the Dirichlet series for {(s) are all positive. As this is 
not true for 1/Z(s), the details of the proof naturally follow a different 
course. 


* Bohr and Landau (1) and (2). See the list of references at the end. 
+ Littlewood (4). { Titchmarsh (5). 
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2. We first require some formulae. 
Let k be a positive integer. Then, for o > 1, 


I 1\* k  k(k—1) 1 _ (—1)F 
== j| — — —— ]— — +... af 
yy | 7) IT ( p12 pe pe 


man Bae <> b,.(n) 
Hen eye = 2 a (1) 


where the coefficients b,() are determined in an obvious way from 
the preceding product. 


The form of these coefficients shows that 


S [Ox(m)| I(! eee 1, ‘) 


n=1 


ia Tes 
ns f' 1.23 = 


113) -T1e-By-Ay a 


We have also to study the function 


G(s) = } ee. (3) 


n=1 
It is clear from the way in which the coefficients b,(n) are formed 


that, for o > 1, 


~ ‘42 92 ps edi ps 


ek—-1)? 1 .) 


ay, where 


“| 2 
*__ ghmeimd dd 


fz is real, 
—m)! 


i (1+ 22! cosd+z)* dd 


ale +7008) « Id. 
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It follows from (7) and Laplace’s integral for the Legendre poly- 
nomials that* 


g.(2) = (I— ap, (* + ‘), (8) 


__* 


where P,(z) is the Legendre polynomial of degree k. This formula 
of course holds for complex z also, and hence 


>» {l+p-> 
)= TT [ee 
. 3 , (itp . 
~ {£(8)}* ra )} (9) 


This formula is of some interest in itself, and leads to the identity 


= Fi.1(8)/{S(s) "4, (10) 


where f 


Here, however, we can get what we want more conveniently by 
using (6) directly. If z is real and positive, 
7 
Ix(2) S [a +24) dp = (1-+-2#)*. 
0 
Hence, for o > 2, 


aon T1Qpl= TFSI 


On the other hand, 
ak 


9;.(z) > ~ | |1-+-ztetd |2k dd 


alk 
Facet me Iams <9 ‘dd. 


ah eh a 


Here 
1—cos¢ < 2sin?(7/2k) = O(k-*), 2#/(1+ 2z#+z) = O(1). 


* The formula (8) is, essentially, Murphy’s well-known formula 
P(cos0) = cos*}0 F(—k, —k; 1; —tan*$@) 
with z —tan*30; cf. Hobson, Spherical and Ellipsoidal Harmonics (1931), 
pp. 22, 31. 
t See Titchmarsh (5), pp. 74-5. 
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alk 
Hence 9,(z) = cer | (1+-0(f)} a6 
ee 
+z) 2k 
<a a3) 


for k > ko, independently of z. Hence, for o > 1, 


G.(c) > I] (5z(1+ rs x) } (14) 


pr 


where x is any positive number. 


3. We can now prove the main theorem. Let o > 1, and let N be 
any positive number. Then 


T T 
F far. 1 | b,.(m) b,.(n) 
7 | | z a hee : a Done dt 
0 adie = N nei 
bj.(m)b,(n) ( (n\* 
Se. ————- | |—) 
n TH +7> 2 mn? ‘() 
{b,.(n)}* b,.(m)b,(n)| 2 
Ee ak" ‘k RMT | 
5! ne) «oT 7 > “a mn? \log(n/m)| ° 


1 
> lost lo >, 











Here 





n 
log — 
m 





so that the last term is less than 
4N [b,(m)b,(m)| _ 4N' /S |b, (n)|\2__ 4 f &(o) \* 
Ton “= 7 (2 “a ) = Flee” 
Since {(c) ~ 1/(a—1) as o > 1, we have, if o is sufficiently near to 1, 


C(c) 3 9 1 


{(20) ~ {(@(e—1) — #(o—1) ~ o—1” 


and the above last term is less than 


4N/T(o—1)*. 
| b,.(n) \b,.(n)| 1 \b,.(n)| 
wor 2 w SQ ie 


nN 
1 (é(do+s)*_ 1 [2 \* 
be Wie +1) S yre- Ae Al 


Also 
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for o sufficiently near to 1; and ange? 


: {b,.(n)}* {b,.(n) {b,.(n)}2 
7,.(20)— 7 = s ce. 
G,.( a) a nz dy n n)} <V ol as not 


1 (f(0+3 ee 5 ( 2 i 








s—1 


G.,. 1 = Buc 
1 oP ) 1| f(o+1) j No-1\oG 





] 
<— We 


These two differences are therefore both bounded if 


9 2k/(o—1) 
N = [- : 
o— 1 


With this value of N we have 


2 4N/ 1 \% 
+. O(1)| dt. > G,(20)—— (—__} + O(1 


> TT fex(*+ 3) | (a) +m, 


by (14) of § 2. The number of terms in this product is 7(~) << Az/log x. 
Hence, if x > vk, 


—rl ] Axzjlog x ; A: uc =e 2k ae 
per 
Also 


i I (+=) - > os) on > 0(loex j *) 


Px PLx 


= Of es \" = Of{(o—1)log 2}, 


and* as x > oo |] ( a = {b+-0(1)}log x. 


peux 





Hence 


. l Ita +53) ' > e-Ae-Ak(o-log fh 4 9 (1)}2k loge, 


prx p* 
Let «= dk, where k?<68< 1, and o=1-+y7/logk, where 
0<7< 1. Then 


2k 9 »\ (2k log k)/n+2k 
Nv La < (? log k / 
o—l n 


* Littlewood (4), p. 350. 
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and we obtain 


ss e~Adk—Ank{h + 9 (] )ye* log?*8k — 


(2k log k)/n+2k 
7) ig k)/n +0(1). 


7 


2 -\ (2klog k)/n+2k 
Finally, let ee (- =e") cikieal 
7 


Then log T = (~2e" + 21) log Z— 
7 


9 > 
loglog T’ = log k + + log (=28* = a ) + loglog - = 
7 


< (1+e)logk 
fork > ky = k,(e,y). Hence 


1)| dt 


il ama _(loglog T' 
> e-Adk—A mfb-+0(1))(— re — 


4. Let M,7 = max |1/C{(o+-7#)|. 
o<t<T 
Since the first term on the right of the above inequality tends to 
infinity with & (for fixed 8, y, and e), it is clear that M*,, tends 
to infinity; hence 


rae + 0 |< 2M », 


if k is large enough, py we deduce that 


14 logs 


4M*,, = J le- Adbk- Ank{h + +0 (1yp4{ EOE sy 


for k large enough. Hence 


1/2k 
Mor > (5) e-At-anto-+0(1)}( EPS log). 


e “—¢€ 


Giving 5, e, and » arbitrarily small values, and then varying 7’, it 
is clear that 
Mr 


lim 
-" loglog T ~~ 
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where, of course, o is a function of 7. But if » < 5, it follows from 
the usual Phragmén-Lindeléf argument* that 

|1/Z(a+it)| < (u+e)loglogt < (b—e)loglogt 
throughout the strip 1 < o < 2. This contradicts the result obtained, 
and the theorem therefore follows. 


* See Littlewood (3), p. 193. 
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SPATIAL DISTANCE IN GENERAL RELATIVITY 
By A. G. WALKER (Edinburgh) 
[Received 20 August 1932] 


A PROBLEM introduced recently is that of finding a suitable definition 
of spatial distance, i.e. the distance of a star from an observer, in 
a general Riemannian space-time. Professor E. T. Whittaker* gave a 
definition based on the method of comparing absolute with apparent 
brightness of the star, and this has been modified to allow for the 
motion of the star. In this paper is introduced a similar definition, 
made possible by a new theorem on null geodesics. It can easily be 
verified that this definition is equivalent to that given by Etherington. 

Relative coordinates are frequently used in this paper, these having 
already been introduced by the author.{ 


1. Relative coordinates 

If C is a curve in a Riemannian space V,, and if an orthogonal 
ennuple is given at points of C, then at each point P of C is defined 
a system of normal coordinates with P as origin, the parametric 
directions at P in these coordinates being the directions of the vectors 
of the given ennuple at this point. Thus there is a system of reference 
at each point of the curve. These coordinates are called relative 
coordinates, and are written (z°,z},...,2"-1). If the points of C are 
defined in terms of a parameter s, taken to be the arc if C is not null, 
any other curve C’ may be given by the z’s as functions of s, a (1, 1) 
correspondence being set up between the points of C and C’. In 
general, we only consider curves in the neighbourhood of C, so that 
the third, and sometimes the second, powers of z may be neglected. 
It is usually convenient to take a particular correspondence, the 
most useful being such that z° = 0, i.e. if A,* (o, 7 = 0, 1,..., n—1) 
are the components of the vectors of the given ennuple, the point 
Q of C’ corresponding to P of C is the point where C’ meets the 
geodesic surface orthogonal to the direction A,‘ at P. 

If x’* are the given coordinates of V,, the curve C is defined by 
equations of the form «’' = x'(s), and if g;; is the fundamental tensor, 

* Proc. Roy. Soc., A, 133 (1931), 93. 


+ Etherington (not yet published). 
t Proc. Roy. Soc. Edin., 52 (1932), 345. 


4 








72 A. G. WALKER 
the tangent vector of C is A‘, where 
At = dzx*/ds, 9;jX'N = e, (1.1) 

and e = 0,-+1 according as C is, or is not, null. Invariants w’, y,, 
are defined by the equations 

v= Ur, To — “Ta Ag igry ‘nN, (1.2) 
an index appearing as a subscript and also as a superscript indicating 
summation, unless attached to e = +1, the summation being from 
0 to n—1 unless otherwise stated. With this notation, the are 8 of 


a curve C’ near C is given by 


ds\* st : 
| ¢ = [6 > | » 2 9», aOsVv 
+(3 ik 2Y on ulze+-2 2 €y Urn? —- p 3 (27) T Yor 272" +- 
ds a o 
cs 2 Co Vou Yov 22” +1, 2%", (1.3) 
o 
. ‘ 30 —— Aa : =e 
where 2° = dz*/ds, Cy = GizAoi'Ao? = £1, 
:, = hyky in f 
and yy = No = Brigg A"A*AQ('A,/- (1.4) 
Y oa ¢ ’ Ie > > 
a - > ] af > 
If C is a geodesic, then, from (1.2) 


Yoh ul = A, i,j Ar ce ey u’, 


and hence Von wiz?+ > e,u%? = J, 
7 o 


where J =} ¢, 0s". (1.5) 


If C is a null geodesic,* and C’ a null curve, e = 0, and ds/ds = 0, 
and from (1.3) we have 
J = const. (1.6) 


to the first order of approximation. 


2. A theorem on null geodesics 

If P), P are points on a null geodesic C, let p denote the thint 
pencil of 0o”-* null geodesics through P, passing near P. We shall 
prove that the volume of the (n—2)-dimensional cross-section of p by 
a surface orthogonal to a vector V' at P is independent of the direction 
of V+. 

* It is assumed that when C is a null geodesic, the parameter s is so chosen 
that the geodesic equations of C take the usual simplified form. The parameter 
is then defined except for arbitrary additive and multiplicative constants. 

+ There should be no ambiguity in the use of the word ‘thin’. Although 
the inner product of the tangent vectors at FR, of two members of the pencil 
is zero to the first order, the angle between their projections on a space not 
orthogonal to C at P, is small, i.e. the solid angle of the pencil as measured 
by an observer at FP, is small. 
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The pencil p being thin, the cross-section is evidently independent 
of the particular surface orthogonal to V‘. It is assumed that the 
direction of V‘ is such that the square of distances in the cross-section 
may be neglected. Referring to relative coordinates along C, we shall 
take 2° = 0, as explained in § 1, and we shall show that the volume 
when V‘ = A,‘ is equal to the volume for any other cross-section. 
The null geodesics belonging to the pencil p are given by equations 
of the form 
ff = #0, a’, &,...@° 9 (r = 1, 2,...,2—1), 


where z” = 0 at P,, and the a’s are constants, varying for the dif- 
ferent geodesics. A surface orthogonal to A,‘ is z° = 0 in relative 
coordinates, and, writing Dz” = (@2"/éx”)dx”, the linear element of the 
cross-section at P by this surface is given by 
n—1 
+do* = > e,(Dz’)?. 
r=1 
If the surface z° = 0 at P meets a null geodesic CO’ of p at the 
point Q, it has been shown that the tangent vector of C’ at Q is 


u°, where a . 
o . ee 


a 7 — Cy Voy?’ 


b 


Hence, a point Q’ of C’, near Q, has coordinates 2” = z°+-¢u", where 
¢ is an arbitrary function of s, of the order of smallness of the z’s. 
Choosing ¢ so that Q’ is the point of intersection of C’ with a surface 
orthogonal to a vector V‘ at P, and neglecting second-order terms, 
the points of the cross-section of p by this surface are 27 = z7+-¢u”, 
where ¢ is now a function of s and the a’s. The element of length 
of this cross-section is given by 
4dé ="5'e,(D)? = ¥ e,(Dz?-+-u"D¢)?. 
o=0 co 

We have > e,(u’)? = 0, and }e,u7Dz? = 0, the latter following 
from (1.6), in which the constant is zero as the z’s all vanish at P,. 
Hence de® = + > e,(Dz°)? = do’. 
Thus the linear element of the cross-section is independent of the 
particular cross-section through P, from which the theorem follows 
immediately. 

It has been pointed out to me that this theorem is a direct 
generalization of a simple property of surfaces on a null cone in a 
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flat space. In space of the form 
ds? = dt?—dx?—dy*?—dz* 
the null cone through the origin is given by 
t= #i, y =rm, Z2= rn, (=F; 1?+-m?-+-n? = 1, 
and a surface on this cone is given by an expression for r as a func- 
tion of 1,m,n. Its line element is 
do? = —dr?+ > (rdl + ldr)? = r* > di, 
since > /? = 1, }/dl = 0. Thus the elements of distance and area 
at any point are the same for all surfaces through this point. 


3. The equations of geodesics 
If C is a geodesic, not null, and the vectors A,,° are given by 

parallel displacement along C, then y,, = 0, and, from (1.3), 
ds 
ds 
Varying the z’s as functions of s, the geodesics near C are given by 
5 f T'ds = 0, ie. by 

d (eT\ eT 

ine) 


Substituting from (3.1), these equations become 


si = lteJ—4J2+ te b3 €,(2")?+ sel", 22”. (3.1) 


= (o = 0, L. 


ta) — ae 
ao ov’ — ey? 7 
z2°—e, 1,2” = eu. 

If, as before, we take z° = 0, then 

ewdS = —egI4, 2”, 


and hence, if Ap‘ is not orthogonal to C, the geodesics are given by 


2"— (« Tn — 25 0T} = 0 (r,s = 1, 2,...,w—1) (3.3) 
U 


to the first order in the z’s. 

We observe that these equations are unaltered when e = 0, although 
the method of obtaining them is not valid in this case. We should 
therefore expect (3.3) to be the equations of geodesics when C is null. 
This is true, and can be verified by an actual transformation of 
coordinates in V,. In what follows, we take C to be a null geodesic, 
the vectors A,’ being given by parallel transport along C. From a 
property of null vectors we may take the vectors A,/',..., A," to be 
independent of A‘, and orthogonal to C. In this case u?,..., u”-1 


vanish, and (1.6) becomes 
z1 = const. (3.4) 
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From the identities T,u? = 0, > e,(w’)? = 0, equation (3.3) for 

r = 1 is now satisfied identically, and the remaining equations give 

sp—e, 1,2" = 0 we (3.5) 

where r takes the values 1 to n—1. Hence, the null geodesics near 
C are given by equations (3.4), (3.5), together with 2° = 0. 


4. Spatial distance in general relativity 

Let A be a star, B an observer, and, in order that light may pass 
from A to B, let A and B lie on a null geodesic. We shall adopt 
the following definition: 

The spatial distance of A from B is proportional to the square 
root of the two-dimensional cross-section of a thin pencil of null 
geodesics issuing from A and passing near 6, made by the three- 
dimensional instantaneous space of the observer. The definition is 
made complete by the requirement that when the observer is near 
and has the motion of the star, the spatial distance must reduce to 
the element of length in the observer’s instantaneous space. 

The instantaneous space of an observer is orthogonal to his direc- 
tion of motion, and hence, from the theorem of § 2, the above cross- 
section is independent of the direction of motion of the observer. 
Thus we see that spatial distance is independent of the observer's 
motion. It follows, from the definition, that the formula for spatial 
distance will consist of two factors, the first being an invariant 0 
depending only on the positions of the star and observer, the second 
being independent of the position of the observer. The main problem 
is to find the above invariant. 

If A‘ is the tangent vector of the null geodesic C, let A,\*, Ag," be 
two orthogonal vectors independent of A‘, both being orthogonal to 
C and given by parallel transport along C. Then, from (3.4) and 
(3.5), the null geodesics through A passing near B are given by 


ol F 
1 =O and P4T yet =0  (p,g = 23), (4.1) 


where [,, are defined by (1.4), and are functions of s.* We have 
assumed that the vectors are chosen so that e, = e; = —1, these 
vectors being supposed to lie in the instantaneous space of some 
observer at B. The problem therefore reduces to the solution of the 


simultaneous linear differential equations (4.1). 


* It is assumed that the parameter s is chosen so that it increases when 
passing from A to B along the null geodesic. 
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If the position of A is given by s = 8», it is required that the 
solutions of (4.1) should vanish when s = 8). They can therefore be 


written in the form 
2 = a +Bdo, 2 = aap, + Bio, (4.2) 


where ¢,, d., ¥,, %. are functions of s and 8s) vanishing at 8), and a, B 
are small arbitrary constants. The cross-section of the pencil of null 
geodesics by the plane z® = 0 at B is given by equations (4.2) and 
zi — 0, the variables being «, 8, and s having its value at B. The area 
of cross-section is at once found to be proportional to (¢, 4.—¢,,), 
and the required invariant © must be proportional to the square root 
of this function. We therefore define 


+0? = K(, b.—2 4), (4.3) 
where the arbitrary constant K is chosen so that © ~ s—s, as 8 > 8». 
Thus 

Ld = lim $i t2—be va 


ss, (8—8p)° 


From the definition, the spatial distance between the points 8) and 
s is given by an equation of the form 


A = 0, (4.4) 


where v is independent of s. For an observer B’ at the point 8)+e, 
where ¢ is small, we have © = e, from (4.3), and A’ = ve. If the star 
and the observer B’ both have the motion v*, the distance 6 between 
the star and B’ in the instantaneous space of this observer is 


ome Ii; vn’ |... 8," 
Hence, to satisfy the second part of the definition, we must have 
a= 9ij vi! 8=8,° (4.5) 


Thus the spatial distance between the star and the observer is 
A = v@, where v is the inner product of the direction of motion of 
the star and the tangent vector of the null geodesic, and © is the 
invariant defined as above. We observe that © and vy are not com- 
pletely determined by the positions of the star and observer and the 
motion of the star, owing to the arbitrary multiplicative constant 
attached to the parameter s. This constant, however, is cancelled 


in the product A = v0. 
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5. Spatial distance in space of constant curvature 
In a space of constant curvature K,, the components of the 
curvature tensor satisfy 
Ryije = KoGnj Gixr—Ink Iis)- 
We have g,;A‘’ = 0, and g,;A‘A,/ = 0 for p = 2,3, and hence, from 


(1.4), ‘ 
Tg =9 (p,.9 = 2,3). 
From (4.1), the null geodesics through s, are given by 
z* = a(8—8p), z® = B(s—8,), 


and from (4.3) we have onium 


Hence, in a space of constant curvature, if the parameter s of the null 
geodesic is chosen to vanish at the position of the star, the spatial 
distance is proportional to s.* 


6. A more general method of obtaining spatial distance 

The method of § 4 depends on finding two solutions of the equa- 
tions of parallel transport along the given null geodesic. In some 
cases this may be impracticable ; it can be avoided as follows: 

Let A,)*, As be two orthogonal vectors at points of the geodesic, 
both being orthogonal to the curve and satisfying e, = es = —1. As 
before, we have u2 = u? = 0, and z!= 0 for the null geodesics 
through A. Completing an orthogonal 4-uple at points of C, we find, 
by a method similar to that of § 3, that the null geodesics through 


A é ive I J a 1 6 . 
are given by 24 Qy 924+ py Zt = 0, (6.1) 
3 
where Ppq = eet Yaet 2,% Yop Yoq 
= ‘9g Aglt,jk Apt AA*. (6.2) 


The invariant @ can now be found as before in terms of the solutions 
of the linear equations (6.1). 

If one of the vectors, say A,,;, is a solution of the parallel transport 
equations, we have y,, = 0 from (1.2). Also, it can easily be verified 
that for any vector A,‘ orthogonal to such a vector and to the curve, 


3 €4(Y30)* = ij Asin Agi? 5, AYA == @, 


* This has been proposed by Professor E. T. Whittaker as an alternative 
definition of spatial distance in a general space. Another definition by H. 8. 
Ruse, Proc. Roy. Soc. Edin., 52 (1932), 183, leads to this result in a general 


space. 











78 A. G. WALKER 
and hence, from (6.2), 
Poq = Une (p,q = 2,3). (6.3) 


Thus equations (4.1) are still true when only one of the vectors is 
given by parallel transport along the geodesic. 


7. The application to an important class of spaces 
We shall now consider the application of the foregoing methods 
to space of the common form* 
ydt? —y—dr?— rd? — r*sin?0 d¢?, (7.1) 
where y is a function of r alone. It is easily shown that, by a suitable 
choice of s, the equations of a typical null geodesic can be written 


in the form 


di > al, og dp _ 


a> on ae 


1” 


where = =; e= +1, 
=) 


and h is a constant. If the geodesic is through the origin, h is zero. 
A solution of the transport equations along this curve is 
A," = (0, 0, 1/r, 0), 
and a vector orthogonal to this vector and to the curve is 
A; = (0, —ehy/r, 0, p/r). 
The non-vanishing components of the curvature tensor are 


Rour0 = 2Y"> R220 = 3vy'; Rosso = 2ryy' sin*?, 


Lane, Baw — rE sin*s, Ros32 = —?2(y—1)sin26, 


Y 4 
and hence, from (1.4), (7.2), (7.4), and (7.5), we find 


h? ; 
D3. a (B°Y —yth 


(7.6) 


Ty = 0, 
h2 


33 = aa ("y’—y'). 


* This form includes some well-known spaces in general relativity, e.g. the 
gravitational fields of a material particle and an electron. 
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If the geodesic passes through the origin, h = 0, and from 


(7.7), 
ae =0 (p,q = 2, 3). 
The null geodesics through the point s = s, are therefore given by 


9 


eect a(8—89), a= B(s—S), 
and we have O = 8—&. (7.8) 
In this case r--s = const., and hence 0 = r+7o, |r—ry|, according 
as the star and observer are, or are not, separated by the origin. 

If h ~ 0, write 2? = rx, z? = rpy. Substituting (7.7) in (4.1), and 
changing the independent variable from s to ¢ by (7.2), these equa- 
tions become dx 


e 4+ (7.9) 


(7.10) 


Hence we have ‘ (7.11) 


(7.12) 


where 


and the subscript 4 indicates the value at the position of the star. 
The required invariant is now given by 
“a = Krrp( a sin(4—¢p), 
where = = lim Rae Ul o)sin (g— Po) 
8 ea 
We have 
de i lim sin(¢—$o) _ 1 lim v—" st a 
dp h = ¢+4, $—$o $$, P— bo Po 
Hence K = r5y,/h?, and we have 


£0? = 7 ri Yabo V—Tsin(b—dy). (7.13) 


It is interesting to observe that, in certain cases, this method of 
defining spatial distance will give unexpected results. These would 
be due to the term sin(#—¢,) in the above formula, for it may be 
possible for the null geodesics to be sufficiently curved so that ¢) and 
7+, give distinct points on the curve for some values of ¢9. In such 











80 SPATIAL DISTANCE IN GENERAL RELATIVITY 

a case, if the observer is placed near the point 7-+-¢,, the star would 
appear to be very bright, and the above definition would indicate 
only a small spatial distance. This could quite possibly occur when 
light from a star passes near another heavy body, and it should be 
possible to find two stars, satisfying the required conditions, such 
that the earth is near the critical point of observation. It is seen that 
the above peculiarities arise from equation (7.11), which shows that 
it is possible for a pencil of null geodesics to converge again in one 


direction. 
In conclusion we may remark that the above method of calculating 
spatial distance has been successfully applied to space of the more 


general form 
ds* = vdt?— R*(y—ldr?+-r*d6?-+-r*sin?6 d¢?), 


where F is any function of ¢, and v, y are any functions of r. 
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